Loading…
This event has ended. Create your own event on Sched.
Welcome to the Earth Science Information Partners (ESIP) 2018 Summer Meeting! The 2018 theme is Realizing the Socioeconomic Value of Data. The theme is based on one of the goals in the 2015 - 2020 ESIP Strategic Plan, which provides a framework for ESIP’s activities over the next three years.

All Presentations are being added to a Google Folder temporarily and then will be moved to FigShare and linked to the sessions here. 
View analytic

Sign up or log in to bookmark your favorites and sync them to your phone or calendar.

Monday, July 16
 

5:00pm

Registration Open
Monday July 16, 2018 5:00pm - 7:30pm
Foyer 880 E 2nd St, Tucson, AZ 85719
 
Tuesday, July 17
 

7:30am

Registration Open
Tuesday July 17, 2018 7:30am - 8:00am
Foyer 880 E 2nd St, Tucson, AZ 85719

8:00am

Breakfast Newcomers
Welcome to the ESIP Meeting! We are glad that you are here. This is an informal breakfast to meet a few ESIP leaders before the meeting starts. 

Speakers & Moderators
avatar for Annie Burgess

Annie Burgess

ESIP Lab Director, ESIP
avatar for Christine White

Christine White

Technical Advisor, Esri


Tuesday July 17, 2018 8:00am - 8:30am
Grand Canyon Ballroom
  • Remote Participation Link https://global.gotomeeting.com/join/752150301
  • Remote Participation Access Code 752-150-301
  • Remote Participation Phone # (646) 749-3129 More phone numbers Australia: +61 2 9087 3604 Austria: +43 7 2081 5427 Belgium: +32 28 93 7018 Canada: +1 (647) 497-9391 Denmark: +45 32 72 03 82 Finland: +358 942 72 1060 France: +33 170 950 594 Germany: +49 692 5736 7317 Ireland: +353 16 572 651 Italy: +39 0 247 92 13 01 Netherlands: +31 202 251 017 New Zealand: +64 9 280 6302 Norway: +47 21 93 37 51 Spain: +34 932 75 2004 Sweden: +46 853 527 827 Switzerland: +41 435 5015 61 United Kingdom: +44 20 3713 5028

8:30am

Welcome: Overview of the Week
This session brings us all together to kick-off the week, layout the general plan for the week, get oriented to Tucson and our local host, University of Arizona, and highlight things that you need to know to make your week as productive as possible. 

Speakers & Moderators
avatar for Christine White

Christine White

Technical Advisor, Esri


Tuesday July 17, 2018 8:30am - 9:30am
Grand Canyon Ballroom
  • Remote Participation Link https://global.gotomeeting.com/join/752150301
  • Remote Participation Access Code 752-150-301
  • Remote Participation Phone # (646) 749-3129 More phone numbers Australia: +61 2 9087 3604 Austria: +43 7 2081 5427 Belgium: +32 28 93 7018 Canada: +1 (647) 497-9391 Denmark: +45 32 72 03 82 Finland: +358 942 72 1060 France: +33 170 950 594 Germany: +49 692 5736 7317 Ireland: +353 16 572 651 Italy: +39 0 247 92 13 01 Netherlands: +31 202 251 017 New Zealand: +64 9 280 6302 Norway: +47 21 93 37 51 Spain: +34 932 75 2004 Sweden: +46 853 527 827 Switzerland: +41 435 5015 61 United Kingdom: +44 20 3713 5028

9:30am

Governance Transition for the Data Management Training Clearinghouse Expansion Project
The collaboratively developed and maintained Data Management Training Clearinghouse (DMTC) has recently been awarded a three year Institute of Museum & Library Science (IMLS) grant as part of its National Digital Library Program. The IMLS grant will enable the DMTC to expand and enhance its web presence, and services by expanding and diversifying the content included in the clearinghouse, enhancing cataloging and classification, and developing a method to enable feedback between trainers and trainees. The DMTC has been shepherded by the ESIP Data Management Training Working Group (DMT WG) which is nearing the end of its two year cycle. This session will be a working session where current DMT WG members work with potential DMTC Advisory Board members to define the governance structure and strategies for the future scope, activities, and services of the DMTC. Following this session, a business meeting will cover the accomplishments of the DMT WG over the past two years, and launch the IMLS expansion project.

Speakers & Moderators
avatar for Karl Benedict

Karl Benedict

Director of Research Data Services, University of New Mexico
For nearly 30 years Karl Benedict has had parallel careers in Information Technology, Data Management and Analysis, and Archaeology. Over the last 22 years at UNM he has worked as a Graduate Student in Anthropology, Research Scientist, Applied Research Center Director, and currently... Read More →
avatar for Nancy Hoebelheinrich

Nancy Hoebelheinrich

Principal, Knowledge Motifs LLC
See my LinkedIn profile at: https://www.linkedin.com/in/nancy-hoebelheinrich-0576ba3


Tuesday July 17, 2018 9:30am - 11:00am
Canyon A

9:30am

Interoperability within the DataONE federation: Participating in the network
The amount of data researchers are generating is exploding and as a result, so to are the opportunities for more comprehensive and novel analyses incorporating the work of many investigators. These data are however, scattered across the globe in different formats and accessible through a variety of mechanisms, challenging researchers, educators and others to find the specific data they need. Repositories managing these data in ways that promote precise discovery and recall are positioned to become leaders in scientific knowledge and the creation of data stewardship. DataONE enables repositories to increase visibility and exposure of data through a federated network utilizing a well tuned and consistent, shared infrastructure for search and access to data across participating repositories. The DataONE infrastructure facilitates data preservation, replication, attribution and citation, and supports existing data tools as well as registration of repository data services.

During this workshop, data and organization managers will be provided with an overview of the DataONE cyberinfrastructure and the steps necessary to integrate with DataONE. We will highlight the services and features available, provide an overview of the implementation process and review technical requirements allowing sufficient time for specific case studies, questions and answers.

Speakers & Moderators
avatar for Matt Jones

Matt Jones

Director of Informatics, UC Santa Barbara
Data Federation | Open Science | Provenance and Semantics
DV

Dave Vieglais

University of Kansas / DataONE



Tuesday July 17, 2018 9:30am - 11:00am
Canyon B

9:30am

Machine Learning Workshop Report
This session is a series of talks reporting the initial MLWS along with work performed and progress during the 90 day follow-up period.


Tuesday July 17, 2018 9:30am - 11:00am
Madera

9:30am

Optimizing Data for the Cloud

Session Description: When data is shared in the cloud, anyone can analyze it without having to download it or store it themselves, which lowers the cost of new product development, reduces the time to scientific discovery, and can accelerate innovation. However, staging large-scale datasets for analysis in the cloud requires consideration of how data should be prepared and organized to allow fast, efficient, and programmatic access from distributed computing systems. This workshop will provide a forum for members of the community to share lessons learned as they explore ways to use the cloud to expand access to data. It seeks to encourage dialog between users interested in leveraging data in the AWS Cloud for research and application development.


Data Optimization for the cloud: Tools and Services (July 17th, 9:30 am – 11:30 am):

AGENDA

Joe Flasher, AWS (10 min)

Introduction

Dan Pilone, Element84 (10 min)
Title: Interdisciplinary research, heterogeneous data, and the case for Archives of Convenience
Description: Earth Science data is measured in petabytes and represents decades of data collection, evolution of technology and practices, and provides an unparalleled view of our planet. The pace of change is only accelerating: NASA and other agencies are on their way to making hundreds of Petabytes of data available in the cloud, highly scalable processing and analysis architectures and tools are in active use with more being developed every day, and each of these brings with it opportunities for optimization and innovation. This talk demonstrates leveraging the elastic nature of the cloud using GOES-16 data to create ephemeral Archives of Convenience, targeting individual researcher needs, optimized for their problems and tool suites, instead of trying to settle on a single "cloud optimized" solution.

Ilya Khamushkin, Intertrust (10 min)
Title: Earth Data for Everyone
Description: At Intertrust, we believe that working with Earth science data should be easy. Too often file formats, transfer protocols, and cumbersome access interfaces make it difficult for users without domain knowledge to incorporate these data into their workflows. During this session we’ll share our experiences from the past five years building and operating the Planet OS Datahub, our cloud-based data as a service platform.

Marty J. Sullivan, Cornell University (10 min)
Title: The Need for Data Lakes in Climate Science
Description: Climate data is massive. The archive data formats used in the field are difficult to retrieve and analyze, they also come from so many different sources. Learn how and why Cornell University’s department of Earth & Atmospheric Sciences is moving toward the concept of building geospatial data lakes in Amazon S3 and using tools like Amazon Athena.

Sudhir Shrestha, ESRI (10 min)
Title: Scientific Earth Science Data to Cloud Optimized Web Services;
Description: Working with earth science data to extract information sometimes can be challenging due to its diversity and complexity. In this session, we will demonstrate real world examples of successful application of open earth science data in ArcGIS platform. We will share briefly the workflow of optimized scientific data management (ingesting, managing, analyzing and sharing) in cloud and how you can quickly spin up the web applications to share your information products including analytics to larger community. We will share few use cases, such as NOAA High Resolution Refresh Radar (HRRR), Sentinel data and other webmap applications that demonstrate how we access large collections of near real-time data that are stored on-premise or on the cloud, disseminate them dynamically, process and analyze them on-the-fly, and serve them to a variety of geospatial applications.

General discussion (10 min)

Breakout groups: focus on tools and services (30 minutes)

(Continue conversation over coffee - 30 minutes)




Tuesday July 17, 2018 9:30am - 11:00am
Pima
  • Subject Jump In, Deep Dive
  • Remote Participation Link https://global.gotomeeting.com/join/752150301
  • Remote Participation Access Code 752-150-301
  • Remote Participation Phone # (646) 749-3129 More phone numbers Australia: +61 2 9087 3604 Austria: +43 7 2081 5427 Belgium: +32 28 93 7018 Canada: +1 (647) 497-9391 Denmark: +45 32 72 03 82 Finland: +358 942 72 1060 France: +33 170 950 594 Germany: +49 692 5736 7317 Ireland: +353 16 572 651 Italy: +39 0 247 92 13 01 Netherlands: +31 202 251 017 New Zealand: +64 9 280 6302 Norway: +47 21 93 37 51 Spain: +34 932 75 2004 Sweden: +46 853 527 827 Switzerland: +41 435 5015 61 United Kingdom: +44 20 3713 5028
  • Tags Cloud Computing, Data Analytics

9:30am

Introduction to Jupyter technologies and how they are used in the ESIP community
You’ve heard a lot about Jupyter. There are Notebooks and Hubs, but what are they? Do they make it easier for you to do or share your work?

Participants in this session will be given an overview on how ESIP members are using the Jupyter Project’s applications to accelerate their own research. This breakout session is intended as an introduction not only to Jupyter applications and their usage in ESIP member organizations. Workshops using the technologies via ESIPhub later in the Meeting will also be discussed. We will hold a ten minute discussion after the presentations on the topics brought up during the talks and how we as a community can use the ESIPhub resource.

Frank Greguska, NASA JPL (15min)
Title: Using Apache Science Data Analytics Platform from Jupyter
Description: Apache Science Data Analytics Platform (SDAP) is an open source Apache Incubator project that, among other things, allows for analysis of scientific data on the cloud. SDAP consists of a collection of webservices that enable science and allow user interaction through Jupyter notebooks. This talk will introduce the Apache SDAP project and walk attendees through some of the algorithms that are available for use.

Tyler Erickson, Google (15min)
Title: Jupyter and Google Earth Engine
Description: Google Earth Engine is a cloud-based geospatial analysis platform that supports analysis of multi-petabyte archives via JavaScript and Python APIs. For users of the JavaScript API. the Earth Engine team maintains an online GUI. For the Python API, we promote the use of Jupyter project tools (JupyterLab, JupyterHub, Jupyter Widgets) for accessing data and developing algorithms.
Presentation: g.co/earth/esip2018-jupyter

John Readey, HDF Group (15min)
Title: HDF Kita Lab
Description: HDF Kita Lab is a Jupyter environment hosted on AWS that provides the ability to easily read and write large HDF datasets.  Users have the ability to utilize HDF Server to access data that would otherwise be too large to copy to the user disk volume.  Data used by HDF Server is stored in AWS S3, which is provides cost-effective and reliable storage.  HDF Kita Lab can be access at: https://hdflab.hdfgroup.org (HDFGroup registration is required).

Rich Signell, USGS (15min)
Title: Jupyter Success Stories from IOOS and USGS
Description: The Integrated Ocean Observing System and the US Geological Survey have been using Jupyter technologies since 2012 to help spread the use of effective and efficient tools across their communities.  These notebooks often demonstrate reproducible workflows based on catalog and data web services and come with reproducible environments made possible by the conda-forge project.  A series of notebooks will be demonstrated, from notebooks demonstrating catalog-driven workflows, to notebooks on binder that appear like web applications.

Keith Maull, NCAR Library (15min)
Title: ESIPHub Pilot | Exploring services and infrastructure to support computational geosciences research and collaboration with JupyterHub
Description: ESIPHub, a JupyterHub-based infrastructure for the ESIP community, is now available and being used within several workshops during the summer meeting.
In this talk, I will discuss the pilot of ESIPHub with UCAR/NCAR's highly successful Research Experiences for Undergraduates (REU) program, SOARS (Significant Opportunities in Atmospheric Research and Science; https://www.soars.ucar.edu). Over the last three years, we have been developing computational workshops to introduced SOARS Protégés to Python, Jupyter, computational thinking and data analysis, and this summer, we piloted ESIPHub within these workshops. I will report on the exciting potential the platform has not only for education and training, but also collaborative research.

Discussion (10)

Learn more about Jupyter and attend the other workshops using ESIPhub:

* Directly after this session is the Metadata Improvement Lab where participants will learn how to translate their xml into JSON-LD using the schema.org vocabulary Google recommends for datasets.
http://sched.co/Eype
* Wednesday afternoon is a workshop for cloud-based analysis.
http://sched.co/EyqK
* Thursday morning we'll learn about some custom widgets for earth science.
http://sched.co/EyqX

Speakers & Moderators
avatar for Tyler Erickson

Tyler Erickson

Developer Advocate, Google
avatar for Sean Gordon

Sean Gordon

Metadata Developer, The HDF Group
Talk to me about the ESIP Labs project, ESIPhub a JupyterHub based shared computational environment for workshops at Meetings.My research focuses on the connections between documentation structures and the evaluation of content for the metadata needs of diverse communities of practice... Read More →
avatar for Rich Signell

Rich Signell

Oceanographer, USGS
Ocean Modeling, Python, NetCDF, THREDDS, ERDDAP, UGRID, SGRID, CF-Conventions, Jupyter, JupyterHub, CSW, TerriaJS



Tuesday July 17, 2018 9:30am - 11:00am
Sabino

9:30am

Software and Services Citations Overview and Use Case
Join us in a brief overview of the current efforts from the Software and Services Citation Cluster including the incredible work that came from our competition last Summer ESIP. Help us gather use cases for different software and services citation needs that will be used during our follow-on workshop being held later this week developing guidance and examples of how those each use case would be represented in a citation.

Speakers & Moderators
JG

James Gallagher

President, OPeNDAP
avatar for Jessica Hausman

Jessica Hausman

Data Engineer, PO.DAAC JPL
avatar for Shelley Stall

Shelley Stall

Senior Director, Data Leadership, American Geophysical Union
Shelley Stall is the Senior Director of Data Leadership at the American Geophysical Union. Shelley has more than two decades of experience working in high-volume, complex data management environments. She has helped organizations in not-for-profit, commercial, defense, and federal... Read More →


Tuesday July 17, 2018 9:30am - 11:00am
Ventana

11:00am

11:30am

Seeking feedback from the ESIP community on two developing dataset maturity assessment models
Abstract:
In collaboration with the ESIP Data Stewardship Committee, the NCEI Use/Service Maturity Matrix (MM-Serv) Working Group has developed a complete draft maturity model for the services provided for individual datasets, where services include both automated (e.g. web services) and human-provided functions.

A complete draft of a WMO-Wide Stewardship Maturity Matrix for Climate Data (SMM-CD) has also been developed by the WMO SMM-CD Working Group in collaboration with international experts who participated in the Expert Meeting on Climate Data Modernisation at the Royal Netherlands Meteorological Institute (KNMI), 16–18 April 2018.

In this session, the latest draft version of both maturity assessment models will be introduced to the ESIP community, followed by a review/comment period in an effort to gain valuable feedback from the community.

Agenda:
Part I: Presentations
1) Opening remark & MMs status - Ge Peng
(The slides are available at: https://doi.org/10.6084/m9.figshare.6854948)

2) Introducing NCEI/ESIP-DSC MM-Serv - Ruth Duerr
(The slides are available at: https://doi.org/10.6084/m9.figshare.6854975)

3) Introducing WMO SMM-CD - Christina Lief
(The slides are available at: https://doi.org/10.6084/m9.figshare.6854993)

Part II: Working Session
Sub-group one - reviewing MM-Serv (group leads: Ge Peng, Sophie Hou, Bob Downs)
(The MM-Serv poster is available at: https://doi.org/10.6084/m9.figshare.6855020)

Sub-group two - reviewing SMM-CD (group leads: Christina Lief, Nancy Ritchey)
(The SMM-CD poster is available at: https://doi.org/10.6084/m9.figshare.6855056)

To cite this session: Peng, G., M.J. Brewer, R. Duerr, W. Wright, and C. Lief, 2018: Seeking feedback from the ESIP community on two developing dataset maturity assessment models. Session. The Earth Science Information Partners (ESIP) 2018 Summer Meeting, 17–20 July 2018, Tucson, AZ, USA.

Speakers & Moderators
avatar for Christina Lief

Christina Lief

Physical Scientist/Consultant, WMO
Recently retired from NOAA/NCEI after a 30 year career with the US Federal Government. I am a Physical Scientist specializing in global observing system climate data (remote sensed & in-situ, data access and documentation). I am presently working as a consultant for WMO as well as... Read More →
avatar for Ge Peng

Ge Peng

Research Scholar, CICS-NC/NCEI
Dataset-centric scientific data stewardship, data quality management


Tuesday July 17, 2018 11:30am - 1:00pm
Canyon A

11:30am

Collaboration among data repositories: replication, deduplication, and interoperability
Environmental data repositories are rapidly adapting to the positive changes in the culture of data publishing, as requested by funders,journals, and researchers. Repositories are increasingly being tagged as the principal site for depositing data and research products from specific sponsor programs (e.g., BCO-DMO for NSF Biological & Chemical Oceanography, EDI for NSF LTER and DEB programs, the Arctic Data Center for NSF Arctic programs, and NCEI for NOAA data of all stripes). This leads to many highly specialized repositories that serve specific communities and are responsible curators for targeted swaths of data. These repositories are then faced with the challenge of replicating copies of data to meet funder expectations while providing an integrated discovery and access system for their communities and across the broader environmental sciences community. Repository interoperability allows federated data aggregators like DataONE and ESDIS to then provide a common discovery and interoperability layer and a searchable view on top of this federated repository infrastructure.

In this session, we will…
  • Explore the concepts of data sharing, data replication, data duplication among repositories and what they mean for the user community (short intro to the problem)
  • Explore some real-word data sharing/interoperability scenarios,
  • Identify the common elements and requirements for data interoperability between repositories (e.g., Elements: Dataset, Funding Award, Persons, Organizations, Roles, etc., and Requirements: ‘Element’ Identification, ACLs, Attribution of sources, PROV, etc)
  • Try to answer the question, “Are the existing science metadata standards sufficient for data interoperability and replication among repositories?”. I.e., can they express the relationship between data in different repositories (‘primary or original’ data, synchronized copy, copy of certain version, subset associated with publication)
Agenda

1) Repository interoperability challenges (Jones) 20 minutes

  • technical: identifier practices, mutability, duplication, versioning and derived data variants, built infrastructure

  • socio-cultural: open source & open communities, NIH syndrome, tech leapfrogging, so many standards to choose from

  • DataONE crosswalk/integration experiences

2) Case studies in interoperability challenges

  • EDI / BCO-DMO (Gries) (10 minutes)

  • BCO-DMO / R2R / NCEI (Shepherd) (10 minutes)

  • Arctic Data Center / IARC/ EDI / LTER (Jones) (10 minutes)

3) Brainstorming, Discussion and Q&A (Shepherd moderates) (40 minutes)

  • What are the easy interoperability wins?

  • What are the hard interoperability challenges?

  • What does it take to build an open community where:

    • Many repositories implement the same API, share identifier and versioning models, and can replicate content without creating new identifiers, and can be searched from a common system like DataONE?


Speakers & Moderators
avatar for Matt Jones

Matt Jones

Director of Informatics, UC Santa Barbara
Data Federation | Open Science | Provenance and Semantics
avatar for Adam Shepherd

Adam Shepherd

Technical Director, Co-PI, BCO-DMO @ WHOI
schema.org | Data Containerization | Linked Data | Semantic Web | Knowledge Representation | Ontologies



Tuesday July 17, 2018 11:30am - 1:00pm
Canyon B

11:30am

Web-based Tools and Data Standards for Electronic Tagging and In situ Datasets: An Interactive & Consultative Workshop
Decision support and other earth science data applications for societal benefit increasingly rely on the integration of multivariate data from various sensors. This in turn hinges critically on interoperability aspects of such data and the means by which they are delivered. The inherent heterogeneity of oceanographic in situ datasets and their variable adherence to data standards poses a significant impediment to interoperability and their long-term data stewardship. The Oceanographic In situ data Interoperability Project (OIIP), funded under NASA/ACCESS, is a collaboration between JPL, UCAR/Unidata and the Large Pelagics Research Center (UMASS-Boston). OIIP aims to address these interoperability challenges with a focus on technology solutions for data from both conventional oceanographic sensors and emerging datasets for electronic tags deployed on biological “glider” platforms. The project seeks to deliver a reusable and accessible set of web-based and open source tools to: 1) mediate reconciliation of heterogeneous source data into a tractable number of standardized, archivable formats consistent with earth science data standards. 2) develop an improved data model supporting metadata rich in situ datasets. 3) enhance THREDDS server technology for support of point, profile, trajectory data series. 4) implement a web-based visualization tool based on JPL’s Common Mapping Client for comprehensive mapping and charting of earth science spatial data types. The objective of this session is to engage directly with stakeholders, soliciting community comment on OIIP capabilities demonstrated in an interactive workshop setting. Interactive demonstrations will focus on the following areas: 1) integrated web-based visualization of in situ and satellite remote sensing data. 2) use of ROSETTA to produce standards compliant data files from ASCII source files. 3) demonstration of new, THREDDS V5.0 support for discrete geometry data types. 4) extensions of geospatial metadata standards developed during OIIP to support “rich”, community specific metadata for enhanced, semantically aware search.



Tuesday July 17, 2018 11:30am - 1:00pm
Madera

11:30am

Optimizing Data for the Cloud
Session Description: When data is shared in the cloud, anyone can analyze it without having to download it or store it themselves, which lowers the cost of new product development, reduces the time to scientific discovery, and can accelerate innovation. However, staging large-scale datasets for analysis in the cloud requires consideration of how data should be prepared and organized to allow fast, efficient, and programmatic access from distributed computing systems. This workshop will provide a forum for members of the community to share lessons learned as they explore ways to use the cloud to expand access to data. It seeks to encourage dialog between users interested in leveraging data in the AWS Cloud for research and application development.


Data Optimization for the cloud: Data Formats (July 17th, 11:00 am – 1:00 pm):

AGENDA


Otis Brown and Jonathan Brannock, CICS-NC (10 min)
Title: Big Data Project (BDP) Data Broker Update
Description: The NOAA Big Data Project Data Broker role and current datasets being provided by CICS-NC are reviewed. NOAA datasets under consideration for provision to the cloud partners are described. An update on GOES-16 accession from AWS S3 including usage by volume and users is given. New policy challenges associated with reformatting datasets and online updated are discussed.

Rich Signell, USGS (10 min)
Title: Cloud-friendly ndarray formats
Description: There is a tremendous amount of scientific multidimensional array data (ndarray) stored in NetCDF or HDF files. Since the cloud uses object storage, not conventional filesystems, there is a need for a "cloud-friendly" storage format that can support the NetCDF and HDF data models. Several solutions have been proposed, including HSDS, Zarr, TileDB, S3-Netcdf, and can be compared with FUSE, which provides a POSIX layer to make object storage look like a filesystem. This talk will discuss what the Pangeo project is doing to explore these data formats and the challenges that remain for the community.

Rob Emanuele, Azavea (10 min)
Title: Cloud Optimized GeoTiffs: enabling efficient cloud workflows
Description: Cloud Optimized GeoTIFFs (COGs) are a raster data format that is a key component to enabling cloud-native geospatial workflows. COGs enable faster reading, writing, and processing of raster data on the cloud without the need for local copies. This talk will include a brief overview of what COGs are and show examples of how they can be used to leverage cloud deployment for research and application development.

John Readey, The HDF Group (10 min)
Title: HDF Data in the Cloud
Description: Amazon S3 is a great storage technology for the cloud: scalable, built-in redundancy, and cost-effective. However traditionally HDF5 files stored on S3 haven’t worked well (or at all) with applications that expect data to be stored on POSIX filesystems, requiring files to be copied to local storage before being accessed. In order to enable HDF data for cloud-based analytics over massive datasets, The HDF Group has developed new methods for storing HDF data on S3 that take full advantage of the storage platform, allows data to be accessed in place, and is compatible with existing applications. This talk will review these technologies and outline some future directions.

General discussion (10 min)

Breakout groups: focus on data formats (30 min)

Report findings from breakout groups (10 min)






Tuesday July 17, 2018 11:30am - 1:00pm
Pima
  • Subject Jump In, Deep Dive
  • Remote Participation Link https://global.gotomeeting.com/join/752150301
  • Remote Participation Access Code 752-150-301
  • Remote Participation Phone # (646) 749-3129 More phone numbers Australia: +61 2 9087 3604 Austria: +43 7 2081 5427 Belgium: +32 28 93 7018 Canada: +1 (647) 497-9391 Denmark: +45 32 72 03 82 Finland: +358 942 72 1060 France: +33 170 950 594 Germany: +49 692 5736 7317 Ireland: +353 16 572 651 Italy: +39 0 247 92 13 01 Netherlands: +31 202 251 017 New Zealand: +64 9 280 6302 Norway: +47 21 93 37 51 Spain: +34 932 75 2004 Sweden: +46 853 527 827 Switzerland: +41 435 5015 61 United Kingdom: +44 20 3713 5028
  • Tags Cloud Computing, Data Analytics

11:30am

Metadata Evaluation Lab at ESIP: Assessing if community metadata is ready for Schema.org
In this workshop participants will determine if metadata from around ESIP is ready to be translated to schema.org JSON-LD using the Google recommendation for datasets to enhance discoverablility in search engines and learn a simple way to translate their XML into compliant JSON-LD that they can improve for their metadata dialect's particular needs.

* Participants will hear about NOAA's experience in creating this information for their datasets and what was needed to apply it.
* Participants will learn how to analyze metadata collections for conceptual content and absolute content.
* Participants will apply a conceptual version of the Google recommendation for dataset metadata using the schema.org vocabulary to their collections and create a report detailing collection metrics, regardless of metadata dialect.
* Finally participants will learn how to create JSON-LD for the records in their collection and validate using the Google Structured Data Testing Tool.

We will use ESIPhub, a Jupyterhub based shared computational environment to run the workshop. This means you won't need to set up your computer to participate in the workshop, just bring a device with a connected web browser.

Learn more about Jupyter and attend the other workshops using ESIPhub:

* Tuesday morning before this session includes a general overview of Jupyter usage in our community.
http://sched.co/Eype
* Wednesday afternoon is a workshop for cloud-based analysis.
http://sched.co/EyqK
* Thursday morning we'll learn about some custom widgets for earth science.
http://sched.co/EyqX

Once you've learned how to make your own schema.org JSON-LD, learn how to improve and publish it in these later sessions:

* Tuesday afternoon includes a two-parter on Semantics in Action.
http://sched.co/Eypw 
* Wednesday afternoon goes into depth on Publishing schema.org datasets.
http://sched.co/EyqH

Speakers & Moderators
avatar for Sean Gordon

Sean Gordon

Metadata Developer, The HDF Group
Talk to me about the ESIP Labs project, ESIPhub a JupyterHub based shared computational environment for workshops at Meetings.My research focuses on the connections between documentation structures and the evaluation of content for the metadata needs of diverse communities of practice... Read More →
avatar for John Relph

John Relph

Disruptor, NESDIS/NCEI
OneStop, Metadata, Archival, Automation, Data Management, Canaan Dogs



Tuesday July 17, 2018 11:30am - 1:00pm
Sabino

11:30am

Supporting integrated and predictive science: Community for Data Integration focus on risk assessment
The aspiration of many data organizations that fund seed projects, including the USGS Community for Data Integration, is to support integrated, reusable, and sustainable tools and data. This year, the CDI funded several projects under the theme of risk assessment and hazard vulnerability, with the goal of coordinating and integrating the outputs. Project teams will meet during this breakout session, which occurs midway through the funding period, to report on progress, learn from each other, and coordinate to optimize their outputs. The selected projects are improving accessibility to drought modeling, hazards and assets data (for example, invasive species, landslides, and infrastructure data), and tools for knowledge extraction and data documentation. Integrating data and resources on hazards and assets improves our ability to assess strategic risk, predict future hazards impact, and realize the socioeconomic value of earth science data.

Speakers
  • Jeanne Jones (USGS) - Community for Data Integration Risk Map Project
  • Caitlin Andrews (USGS) - An Interactive Web-based Tool for Anticipating Long-term Drought Risk
  • Eric Jones (USGS) - Integrating Disparate Spatial Datasets from Local to National Scale for Open-Access Web-Based Visualization and Analysis: A Case Study Compiling U.S. Landslide Inventories
  • Daniel Wieferich (USGS) - Knowledge Extraction Algorithms (KEA): Turning Literature Into Data
  • Dennis Walworth (USGS) - Content specifications to enable USGS transition to ISO metadata standard
  • Kathy Gerst (USA National Phenology Network) - Workflows to support integrated predictive science capacity: Forecasting invasive species for natural resource planning and risk assessment

Speakers & Moderators
avatar for Leslie Hsu

Leslie Hsu

Coordinator, Community for Data Integration, U.S. Geological Survey
avatar for Daniel Wieferich

Daniel Wieferich

Physical Scientist, US Geological Survey
python, database management, landscape ecology, machine learning


Tuesday July 17, 2018 11:30am - 1:00pm
Ventana

1:00pm

2:00pm

Semantics In Action
This two part session will focus on real-life use cases for ESIP's semantic technology resources, and hands-on tutorials to help ESIP members begin taking advantage of these resources. Part 1 will feature presentations from people who are using semantic resources: how they are using them and why, and what they are getting for their efforts. Part II will feature hands-on tutorials to help ESIP members use our semantic resources in their own environments to solve actual problems. Tutorials will include using schema.org for improved search rankings; using JSON LD for linked data; using COR and Bioportal for document annotation.

AGENDA:

Semantic Search in Action in ArcGIS Hub, Pranav Kulkarni, ESRI R & D Center

Esri's ArcGIS Hub has an improved search experience by implementing semantic search using a knowledge graph. The search is context-aware and provides a great user experience in search and discovery of data on ArcGIS Hub.
Pranav Kulkarni will be talking about how his team implemented semantic search at scale and how the knowledge graph can be grown with custom vocabularies further improving the search results.

Semantics in Action in the Cryosphere, Ruth Duerr The Cryospheric science and polar regions communities have a number of organizations and activities, both global and national, that are trying to pull together the observations and data needed to understand the rapid changes occurring in the Arctic.  As part of these activities, work is going on across the full range of the semantic spectrum - everything from controlled vocabularies, to glossaries, to full blown ontologies.  Ruth will discuss some of these activities, their underlying use cases, and how they tie to other semantics activities in ESIP and Earth science generally.

Using the Environment Ontology (ENVO), Pier Luigi Buttigieg, Alfred Wegener Institute Helmholtz Centre for Polar and Marine Research 

The Environment Ontology (ENVO) is a community ontology for the machine-readable representation of environmental entities. ENVO has been built along the best practices of the Open Biological and Biomedical Ontology Foundry and Library, thus reuses and aligns to a suite of existing ontologies to express environmental entities such as geographic, astronomical, and anthropogenic features as well as the processes they participate in. The ontology’s initial uses were in the life sciences, and thus focused on entities such as biomes and ecosystems. It has become a standard resource in the genomes and microbiome communities, and is steadily being adopted in other disciplines. Most recently, ENVO has seeded and interoperates with ontologies in the domains of agronomy, food science, and - in collaboration with UN Environment - the Sustainable Development Goals. It also is providing semantic expression for a number of existing and emerging standard vocabularies, extending their functionality.

 Pier Luigi will discuss typical usage scenarios for the Environment Ontology, including its recent deployment in the UNESCO/IOC-IODE Ocean Best Practice repository and an example of combining ENVO and Gene Ontology to mobilise data  in environmental genomics.





Speakers & Moderators

Tuesday July 17, 2018 2:00pm - 3:30pm
Canyon A

2:00pm

Sustainable Data Mgt - Repository Return on Investment - Paper draft 2
Domain-specific data repositories have been established to curate, archive, and publish earth and environmental observation data in response to the need for archive per requests from funders and publishers. Their domain-specific approach has proven successful in changing the research culture and mobilizing data, developing best practices and standards, and training a data management workforce. The Sustainable Data Management Cluster draws on the collective experience of repositories to promote common collaboration and curation strategies. A major activity of this group has been metrics for calculating Return on Investment (RoI) in such repositories. In this session, we will discuss draft 1 of our paper “A discussion of Return on Investment for data repositories in earth and environmental sciences”, and begin work on draft 2.

Speakers & Moderators
avatar for Margaret O'Brien

Margaret O'Brien

Data Manager, University of California, Santa Barbara


Tuesday July 17, 2018 2:00pm - 3:30pm
Canyon B

2:00pm

Research Project Management Principles and Tools; AKA Juggling 101 - Part 1
"Strategy is a system of expedients. It is more than science, it is the translation of science into practical life, the development of an original leading thought in accordance with the ever-changing circumstances." - Helmuth von Moltke the Elder, quoted in Government and War (1918) by Spenser Wilkinson

Skills and knowledge in effective project management aren't just useful to project leads and mission planners - they are critical to all of us who find ourselves working on teams to accomplish a shared goal. Only through the development and management of a clear plan can we keep on track and identify times when we need to update our plan to reflect changing realities. By systematically identifying project goals and outcomes, who and what is needed to accomplish those objectives, and the definition of a realistic timeline for doing so we can work to ensure that our entire team is on the same page and headed in the right direction. This workshop will provide a combination of a conceptual introduction to key principles for effective project planning with hands-on practice with a powerful open source project planning and management tool - TaskJuggler. By the end of this workshop participants will have had an opportunity to learn the foundational principles of project planning and management, gain experience implementing those principles through the incremental development of an increaslingly complete project plan, and be in a position to continue to build their knowledge and skills through continued use of TaskJuggler as a planning and management tool.

You can follow-along and participate in the workshop in several ways:

  • Run the entire workshop on your personal computer - content & executable programs - Visit the Coffee & Code Content Platform (https://github.com/unmrds/cc-content-platform) GitHub repository and follow the installation and run instructions in the README.md file. Once running, the all of the workshop materials are available in the cc-taskjuggler folder in the instruction or playground Jupyter notebook folders. 
  • Follow along in the "playground" where the presentation files can be copied and executable code run on our cloud-based platform - http://cc.unmrds.net:8888 (the password will be provided in the workshop). 
  • Install TaskJuggler on your computer and create and run your own project plan locally while following along with the workshop materials that can be downloaded from the workshop GitHub repository (Zip Archive: https://github.com/unmrds/cc-taskjuggler/archive/master.zip).
  • Follow along with the static notebooks in the workshop GitHub repository - https://github.com/unmrds/cc-taskjuggler


    Speakers & Moderators
    avatar for Karl Benedict

    Karl Benedict

    Director of Research Data Services, University of New Mexico
    For nearly 30 years Karl Benedict has had parallel careers in Information Technology, Data Management and Analysis, and Archaeology. Over the last 22 years at UNM he has worked as a Graduate Student in Anthropology, Research Scientist, Applied Research Center Director, and currently... Read More →
    avatar for Ward Fleri

    Ward Fleri

    Professional Project Manager & Coach
    Ward Fleri has over 20 years of experience successfully delivering results in a variety of scientific domains and has led teams with diverse technical and scientific backgrounds on multimillion-dollar programs in small business, academic, and non-profit research environments. He... Read More →



    Tuesday July 17, 2018 2:00pm - 3:30pm
    Canyon C

    2:00pm

    Services for Data Value Filtering
    One of the more powerful features of IDL and python programming languages is the where() command (and its corrolary find() command in MATLAB). These commands allow users to filter variables of interest based on data thresholds, data values in other variables and other criteria, allowing downstream operations (in the code) on a filtered subset of the original data array values. For data access services including web services for satellite data products we want users to have the ability to perform similar operations: sub select data values from a specific geophysical variable based on data in other variables. This could be simple space/time subsetting requests or more complex requests that require inputs from quality variables, bit flag "masks" and other ancillary variables. In this session we will look at the state of several popular web services as well as database architectures that can perform these requests. The goal is determine a first iteration "gap and trade" study of the different services and data access methods.

    The  presentations will be as follows (about 20 minutes each)

    OPeNDAP: James Gallagher
    ERDDAP: Bob Simons
    Webification: Edward Armstrong
    OGC WCPS: Luis Bermudez

    Speakers & Moderators
    avatar for Ed Armstrong

    Ed Armstrong

    Technologist, NASA JPL
    avatar for Ethan Davis

    Ethan Davis

    UCAR Unidata
    JG

    James Gallagher

    President, OPeNDAP
    avatar for Bob Simons

    Bob Simons

    IT Specialist, NMFS SWFSC ERD
    I work on ERDDAP, a free and open source data server that gives you a simple, consistent way to download subsets of gridded and tabular scientific datasets in common file formats and make graphs and maps. ERDDAP has been installed and used by more than 70 organizations around the... Read More →


    WCPS pptx

    Tuesday July 17, 2018 2:00pm - 3:30pm
    Madera

    2:00pm

    REO: Work to date, lessons, and possible data/software coming from USGS use of the ESIP Testbed
    USGS has been taking advantage of the ESIP Testbed for a number of data systems that are part of our evolving Modular Science Framework, a vision for enabling scientific infrastructure. Some of our developments are reaching a level of maturity where we will start deploying them to production capacity on USGS infrastructure, but there are some other components that might be useful across the broader community. Examples are our Spatial Feature Registry, a system for integrating usable named/identified spatial features through time for analytical uses, and the Taxa Information Registry, a component that assembles best available information from across disparate data sources on biological taxa of interest in our biogeographic work. We offer this session to share what we're doing in these projects, offer opportunities for other groups to share similar work on the ESIP Testbed, and open discussion on how best to conduct this work moving forward.

    We're particularly looking for feedback on what parts of what we are doing would be best developed by us and others as ESIP common resources. We're building an inherently distributed architecture, and we can run operational components all over the web. As we take ideas from research to engineering, help us figure out the best pathway that will provide maximum value for USGS and for others.

    Speakers & Moderators
    avatar for Daniel Wieferich

    Daniel Wieferich

    Physical Scientist, US Geological Survey
    python, database management, landscape ecology, machine learning


    Tuesday July 17, 2018 2:00pm - 3:30pm
    Pima

    2:00pm

    HDF Workshop 1: Learning about HDF using Jupyter Notebooks

    Jupyter Notebooks have been developed by the HDF Group and many others to help scientists and other users understand how to use HDF to create and access datasets in many disciplines. HDF Lab is a tool for bringing these resources together with data in the cloud. The Lab will include sample datasets and notebooks that use them to demonstrate HDF capabilities at many levels. It will also be a place for sharing data examples and related notebooks from users in many disciplines. ESIP members will play an important role in building this resource and ensuring that it is a useful forum for sharing community expertise. Please join us at the ground level to make sure it works.

     


    Speakers & Moderators
    avatar for Ted Habermann

    Ted Habermann

    Metadata 2020
    I am interested in all facets of metadata needed to discover, access, use, and understand data of any kind. Also evaluation and improvement of metadata collections, translation proofing. Ask me about the Metadata Game.


    Tuesday July 17, 2018 2:00pm - 3:30pm
    Sabino

    2:00pm

    Building a Data Risk Factor Matrix
    Data collections can face a variety of risk factors. The ESIP Data Stewardship Committee is analyzing and categorizing data risk factors to develop a "data risk factor matrix." This activity is intended to inform and enable the geoscience data community to reduce the risks associated with data preservation and stewardship. This session will include presentations on the Data Stewardship committee activity, and engage the attendees in an exercise in which we compare our respective understandings of data risk categories.

    Speakers & Moderators
    SH

    Sophie Hou

    Data Curation and Stewardship Coordinator, National Center for Atmospheric Research
    data management/curation/stewardship: including but not limited to data life cycle, policies, sustainability, education and training, data quality, usability.
    avatar for Matthew Mayernik

    Matthew Mayernik

    Project Scientist and Research Data Services Specialist, NCAR/UCAR Data Library
    Matt is a Project Scientist and Research Data Services Specialist in the NCAR/UCAR Library. His work is focused on research and service development related to research data curation. His research interests include metadata practices and standards, data curation education, data citation... Read More →


    Tuesday July 17, 2018 2:00pm - 3:30pm
    Ventana

    3:30pm

    Networking Break
    Tuesday July 17, 2018 3:30pm - 4:00pm
    Foyer 880 E 2nd St, Tucson, AZ 85719

    4:00pm

    Semantics In Action
    This two part session will focus on real-life use cases for ESIP's semantic technology resources, and hands-on tutorials to help ESIP members begin taking advantage of these resources. Part 1 will feature presentations from people who are using semantic resources: how they are using them and why, and what they are getting for their efforts. Part II will feature hands-on tutorials to help ESIP members use our semantic resources in their own environments to solve actual problems. Tutorials will include using schema.org for improved search rankings; using JSON LD for linked data; using COR and Bioportal for document annotation. If you're a schema.org enthusiast, or just trying to figure out whether you might be an enthusiast, you've come to the right place this year. Other sessions on schema.org include a session on publishing using schema.org Dataset (http://sched.co/EyqH), and a session on assessing the readiness of community metadata for schema.org (http://sched.co/Eypl).

    AGENDA

    Schema.org, Doug Fils, Consortium for Ocean Leadership & Adam Shepherd Woods Hole There is an emerging practice to leverage structured metadata to aid in the discovery of web based resources.  Much of this work is taking place in the context (no pun intended) of schema.org and has extended to the resource type Dataset.  This session will present approaches, tools and references that will aid in the understanding and development of schema.org in JSON-LD and its connection to external vocabularies.  The goal of this session is to provide you the basics to generate example documents that can be tested and validated with various tools. This will form a foundation that can be used for the development of code or delivery solutions in your systems to expose data sets and their associated structured metadata following FAIR principles and leveraging schema.org.  


    Speakers & Moderators

    Tuesday July 17, 2018 4:00pm - 5:30pm
    Canyon A

    4:00pm

    Enabling transparency and reproducibility in science through practical provenance frameworks
    Reproducible science is critical to both researchers and society. Exposing the provenance of research products enables researchers to fully understand computational workflows that led to a result, and is key for computational reproducibility that builds trust in science. Provenance information includes metadata about the structure of scientific workflows, input data and parameters, output data and products like figures and graphs, and software that was executed in the workflow. With provenance, researchers can understand an analysis, guide interpretation of scientific results, propose alternative analysis, and re-execute workflows. Recording all of this in practical systems that are easy to use and available to the research community remains a challenge. During this workshop we will highlight existing and emerging solutions to provenance tracking and explore advances and best practice representing, capturing, and using provenance. Demonstrations of tools and methods supporting provenance capture, editing, and use of provenance for reproducible science will be highlighted.

    Speakers & Moderators
    AB

    Amber Budden

    Director for Community Engagement and Outreach, DataONE
    avatar for Annie Burgess

    Annie Burgess

    ESIP Lab Director, ESIP
    avatar for Matt Jones

    Matt Jones

    Director of Informatics, UC Santa Barbara
    Data Federation | Open Science | Provenance and Semantics
    DV

    Dave Vieglais

    University of Kansas / DataONE



    Tuesday July 17, 2018 4:00pm - 5:30pm
    Canyon B

    4:00pm

    Research Project Management Principles and Tools; AKA Juggling 101 - Part 2
    "Strategy is a system of expedients. It is more than science, it is the translation of science into practical life, the development of an original leading thought in accordance with the ever-changing circumstances." - Helmuth von Moltke the Elder, quoted in Government and War (1918) by Spenser Wilkinson

    Skills and knowledge in effective project management aren't just useful to project leads and mission planners - they are critical to all of us who find ourselves working on teams to accomplish a shared goal. Only through the development and management of a clear plan can we keep on track and identify times when we need to update our plan to reflect changing realities. By systematically identifying project goals and outcomes, who and what is needed to accomplish those objectives, and the definition of a realistic timeline for doing so we can work to ensure that our entire team is on the same page and headed in the right direction. This workshop will provide a combination of a conceptual introduction to key principles for effective project planning with hands-on practice with a powerful open source project planning and management tool - TaskJuggler. By the end of this workshop participants will have had an opportunity to learn the foundational principles of project planning and management, gain experience implementing those principles through the incremental development of an increaslingly complete project plan, and be in a position to continue to build their knowledge and skills through continued use of TaskJuggler as a planning and management tool.

    You can follow-along and participate in the workshop in several ways:

  • Run the entire workshop on your personal computer - content & executable programs - Visit the Coffee & Code Content Platform (https://github.com/unmrds/cc-content-platform) GitHub repository and follow the installation and run instructions in the README.md file. Once running, the all of the workshop materials are available in the cc-taskjuggler folder in the instruction or playground Jupyter notebook folders. 
  • Follow along in the "playground" where the presentation files can be copied and executable code run on our cloud-based platform - http://cc.unmrds.net:8888 (the password will be provided in the workshop). 
  • Install TaskJuggler on your computer and create and run your own project plan locally while following along with the workshop materials that can be downloaded from the workshop GitHub repository (Zip Archive: https://github.com/unmrds/cc-taskjuggler/archive/master.zip).
  • Follow along with the static notebooks in the workshop GitHub repository - https://github.com/unmrds/cc-taskjuggler
  •  

    Speakers & Moderators
    avatar for Karl Benedict

    Karl Benedict

    Director of Research Data Services, University of New Mexico
    For nearly 30 years Karl Benedict has had parallel careers in Information Technology, Data Management and Analysis, and Archaeology. Over the last 22 years at UNM he has worked as a Graduate Student in Anthropology, Research Scientist, Applied Research Center Director, and currently... Read More →
    avatar for Ward Fleri

    Ward Fleri

    Professional Project Manager & Coach
    Ward Fleri has over 20 years of experience successfully delivering results in a variety of scientific domains and has led teams with diverse technical and scientific backgrounds on multimillion-dollar programs in small business, academic, and non-profit research environments. He... Read More →



    Tuesday July 17, 2018 4:00pm - 5:30pm
    Canyon C

    4:00pm

    In-situ sensor QA/QC workflow: case studies and discussion
    The EnviroSensing Cluster is currently focused on workflow improvement for sensor-based science, particularly in the areas of quality assurance (QA) and quality control (QC) to meet FAIR Data (Findable, Accessible, Interoperable, and Reusible) objectives. Provenance for sensor data begins at the procurement/deployment phase and ends with some level of quality-controlled output product. Capture of metadata for hardware deployment, field maintenance, and data transmission represents the bulk of QA work, whereas the subsequent evaluation and flagging of observation data represents the QC process. As part of a two-session series, we will have short presentations (5-10min/ea) on case studies of QA Metadata Capture and subsequent Data Quality Control workflows, followed by group discussion of efficacy and directions for improvement. A separate session on standards for sensor QA/QC metadata annotation and related protocols will follow up on these concepts.

    Speakers & Moderators
    avatar for Renée F. Brown

    Renée F. Brown

    Information Manager, McMurdo Dry Valleys LTER
    aridland ecosystems, sensor networks in ecology, nitrogen and carbon biogeochemical cycles, climate change, long-term ecological research
    avatar for Scotty Strachan

    Scotty Strachan

    Director of Cyberinfrastructure, University of Nevada, Reno
    Institutional cyberinfrastructure, sensor-based science, mountain climate observatories!


    Tuesday July 17, 2018 4:00pm - 5:30pm
    Madera

    4:00pm

    Research Object Citation and FAIR Guidance Materials for Data Managers and Librarians
    Work with us as we define the topics and identify resources that would be valuable to data managers and librarians as they assist researchers with open and FAIR practices for data and other research products as well as best practices for citation. As journals and repositories move to requiring data citations that support your research the community needs consistent guidance that incorporates our best practices developed by our community.

    Speakers & Moderators
    avatar for Nancy Hoebelheinrich

    Nancy Hoebelheinrich

    Principal, Knowledge Motifs LLC
    See my LinkedIn profile at: https://www.linkedin.com/in/nancy-hoebelheinrich-0576ba3
    avatar for Shelley Stall

    Shelley Stall

    Senior Director, Data Leadership, American Geophysical Union
    Shelley Stall is the Senior Director of Data Leadership at the American Geophysical Union. Shelley has more than two decades of experience working in high-volume, complex data management environments. She has helped organizations in not-for-profit, commercial, defense, and federal... Read More →


    Tuesday July 17, 2018 4:00pm - 5:30pm
    Pima
    • Subject Jump In
    • Remote Participation Link https://global.gotomeeting.com/join/752150301
    • Remote Participation Access Code 752-150-301
    • Remote Participation Phone # (646) 749-3129 More phone numbers Australia: +61 2 9087 3604 Austria: +43 7 2081 5427 Belgium: +32 28 93 7018 Canada: +1 (647) 497-9391 Denmark: +45 32 72 03 82 Finland: +358 942 72 1060 France: +33 170 950 594 Germany: +49 692 5736 7317 Ireland: +353 16 572 651 Italy: +39 0 247 92 13 01 Netherlands: +31 202 251 017 New Zealand: +64 9 280 6302 Norway: +47 21 93 37 51 Spain: +34 932 75 2004 Sweden: +46 853 527 827 Switzerland: +41 435 5015 61 United Kingdom: +44 20 3713 5028
    • Tags Documentation, Education, Data Management Training, Data Citation, Software and Services Citation

    4:00pm

    HDF Workshop 2: HDF analysis from the desktop to the cloud
    The HDF Group is exploring many approaches to providing access to HDF data in the cloud with the goal of protecting data producers and users from disruption as data move to the cloud. These approaches include a restful interface and a plug-in replacement for h5py (h5pyd) that uses that interface, and an implementation of xarray that uses that plug-in. The Highly Scalable Data Server (HSDS) also uses this interface and will operate on HDF files or objects created from the metadata and data in those files. We are also developing several HDF5 library plug-ins that implement a Virtual Object Layer (REST-VOL) for access to the cloud using the restful interface and a Virtual File Driver (S3-VFD) for accessing data in the cloud. We will demonstrate use cases for all of these approaches and discuss how each alternative minimizes disruption for data providers and users.

    Speakers & Moderators
    avatar for Ted Habermann

    Ted Habermann

    Metadata 2020
    I am interested in all facets of metadata needed to discover, access, use, and understand data of any kind. Also evaluation and improvement of metadata collections, translation proofing. Ask me about the Metadata Game.


    Tuesday July 17, 2018 4:00pm - 5:30pm
    Sabino

    4:00pm

    Operational Readiness Levels: Establishing Trusted Data to Improve Situational Awareness
    The Disasters Lifecycle cluster, in collaboration with the All Hazards Consortium, is developing Operational Readiness Levels for data-driven decision-making support to improve situational awareness. The AHC’s Sensitive Information Sharing Environment (SISE) Working Group recently communicated the ORL concept to the AHC members, including federal and state agencies as well as private sector companies supporting a broad range of emergency management services. Initial criteria for the ORLs, a flowchart assessment tool, and data examples were demonstrated. We received enthusiastic feedback on the value of the ORL concept, noting that it filled an important void. Work continues on refining strategies and criteria for assessing candidate datasets for specific operational use cases.

    During this session we plan to address several challenges in meeting the trust criteria for establishing ORLs – not only for specific use cases but also across user applications and communities. Questions we expect to explore include:
    • Who can set ORLs; what is ESIP’s role; how to balance data suppliers’ input from end users’ role
    • How to manage ORLs to avoid confusion across user communities
    • How to distinguish global issues that all users would consider criteria (e.g., security, availability, …) vs specific issues related to a specific use case
    • Strategies for handling crowd-sourced information

    Agenda
    • Enabling Discovery and Access to Trusted Data, Karen Moe/NASA ESTO Emeritus
    • NASA Support of 2017 CA Wildfires: Lessons Learned, Maggi Glasscoe/Jet Propulsion Laboratory
    • SISE ORL Model Implementation Approach and Pilot, Kari Hicks/Duke Energy
    • Displaying ORL Levels in GeoCollaborate, Dave Jones/StormCenter Communications, Inc.
    • End User Community Readiness Ranking – Augmented Metadata, Robert Downs/CIESIN Columbia Univ.

    Speakers & Moderators
    avatar for Karen Moe

    Karen Moe

    Emeritus, NASA ESTO
    Co-chair the Disasters Lifecycle Cluster, ESIP Board Member at Large



    Tuesday July 17, 2018 4:00pm - 5:30pm
    Ventana
     
    Wednesday, July 18
     

    7:30am

    Registration Open
    Wednesday July 18, 2018 7:30am - 8:00am
    Foyer 880 E 2nd St, Tucson, AZ 85719

    8:30am

    Plenary Welcome
    The 2018 theme is Realizing the Socioeconomic Value of Data. The theme is based on one of the goals in the 2015 - 2020 ESIP Strategic Plan, which provides a framework for ESIP’s activities over the next three years. In this short introduction, we will highlight a few relevant ESIP activities related to the theme and introduce the plenary.
    To view a live broadcast of this event streaming on Youtube, click here.

    Speakers & Moderators
    avatar for Christine White

    Christine White

    Technical Advisor, Esri


    Wednesday July 18, 2018 8:30am - 8:45am
    Grand Canyon Ballroom
    • Remote Participation Link https://global.gotomeeting.com/join/752150301
    • Remote Participation Access Code 752-150-301
    • Remote Participation Phone # (646) 749-3129 More phone numbers Australia: +61 2 9087 3604 Austria: +43 7 2081 5427 Belgium: +32 28 93 7018 Canada: +1 (647) 497-9391 Denmark: +45 32 72 03 82 Finland: +358 942 72 1060 France: +33 170 950 594 Germany: +49 692 5736 7317 Ireland: +353 16 572 651 Italy: +39 0 247 92 13 01 Netherlands: +31 202 251 017 New Zealand: +64 9 280 6302 Norway: +47 21 93 37 51 Spain: +34 932 75 2004 Sweden: +46 853 527 827 Switzerland: +41 435 5015 61 United Kingdom: +44 20 3713 5028

    8:45am

    Raskin Scholar
    In March 2012, ESIP lost long-time member, Rob Raskin. Rob was soft-spoken, easy-going, wise and an Earth science information partner extraordinaire. Rob was a mentor to many aspiring Earth science data professionals. In collaboration with Rob’s family, ESIP remembers Rob and his dedication to support the next generation of Earth science data and technology leaders through the Robert G. Raskin Scholarship. The scholarship is awarded annually to an individual in the Earth or computer sciences who has an interest in community evolution of Earth science data systems. The Raskin Scholarship seeks to promote collaboration, research support, and exposure for talented students in the Earth or computer sciences. Special attention will be given to applicants demonstrating an interest in semantics, GIS, cyberinfrastructure and computing in the geosciences.

    ESIP awarded the 2018 Robert G. Raskin scholarship to Sara Lafia, a graduate student in the Geography Department at the University of California Santa Barbara.

    Lafia’s research focuses on enabling the spatial discovery of researcher datasets and publications. In support of this goal, she has develop linked data models and spatializations. During her time at UCSB, she has collaborated with the UCSB Library and the Center for Spatial Studies to advance the use of geospatial platforms, like Esri Open Data, to support spatial data curation and discovery. Sara’s expertise is in GIScience and increasingly in geo­semantics; she is gaining complementary domain understanding of advances in Earth science computing through the ESIP community.

    To view a live broadcast of this event streaming on Youtube, click here.


    Speakers & Moderators
    avatar for Ed Armstrong

    Ed Armstrong

    Technologist, NASA JPL
    avatar for Sara Lafia

    Sara Lafia

    Graduate Student, University of California - Santa Barbara
    Sara is a graduate student in the Department of Geography at UCSB. Sara's background is in Urban and Regional Planning and GIS. Her current research focuses on advancing the spatial discovery of research data. Sara's research interests include geographic information retrieval, data... Read More →
    avatar for Christine White

    Christine White

    Technical Advisor, Esri


    Wednesday July 18, 2018 8:45am - 9:05am
    Grand Canyon Ballroom
    • Remote Participation Link https://global.gotomeeting.com/join/752150301
    • Remote Participation Access Code 752-150-301
    • Remote Participation Phone # (646) 749-3129 More phone numbers Australia: +61 2 9087 3604 Austria: +43 7 2081 5427 Belgium: +32 28 93 7018 Canada: +1 (647) 497-9391 Denmark: +45 32 72 03 82 Finland: +358 942 72 1060 France: +33 170 950 594 Germany: +49 692 5736 7317 Ireland: +353 16 572 651 Italy: +39 0 247 92 13 01 Netherlands: +31 202 251 017 New Zealand: +64 9 280 6302 Norway: +47 21 93 37 51 Spain: +34 932 75 2004 Sweden: +46 853 527 827 Switzerland: +41 435 5015 61 United Kingdom: +44 20 3713 5028

    9:10am

    Falkenberg Awardee
    Hook Hua was awarded the 2017 Charles S. Falkenberg Award at the American Geophysical Union Fall Meeting Honors Ceremony, held on 13 December 2017 in New Orleans, La. The award is for “an early- to middle-career scientist who has contributed to the quality of life, economic opportunities and stewardship of the planet through the use of Earth science information and to the public awareness of the importance of understanding our planet.”

    The Falkenberg is a joint AGU-ESIP award given in honor of Charles Falkenberg. As part of this joint award we are providing a plenary presentation opportunity to showcase the awardee's work.

    To view a live broadcast of this event streaming on Youtube, click here.


    Speakers & Moderators
    avatar for Hook Hua

    Hook Hua

    Data Scientist, JPL/Caltech
    avatar for Christine White

    Christine White

    Technical Advisor, Esri


    Wednesday July 18, 2018 9:10am - 9:40am
    Grand Canyon Ballroom
    • Remote Participation Link https://global.gotomeeting.com/join/752150301
    • Remote Participation Access Code 752-150-301
    • Remote Participation Phone # (646) 749-3129 More phone numbers Australia: +61 2 9087 3604 Austria: +43 7 2081 5427 Belgium: +32 28 93 7018 Canada: +1 (647) 497-9391 Denmark: +45 32 72 03 82 Finland: +358 942 72 1060 France: +33 170 950 594 Germany: +49 692 5736 7317 Ireland: +353 16 572 651 Italy: +39 0 247 92 13 01 Netherlands: +31 202 251 017 New Zealand: +64 9 280 6302 Norway: +47 21 93 37 51 Spain: +34 932 75 2004 Sweden: +46 853 527 827 Switzerland: +41 435 5015 61 United Kingdom: +44 20 3713 5028

    9:40am

    Plenary
    • Dr. Stephanie Rainie, (20+5)
    • Dr. Christopher Guiterman, Tree Ring Lab (20+5) 
    • Dr. Jon Chorover, Critical Zone Observatory (20 mins + 5 for questions)
    To view a live broadcast of this event streaming on Youtube, click here.


    Speakers & Moderators
    avatar for Jon Chorover

    Jon Chorover

    University of Arizona
    Professor of Environmental Chemistry and heads the Environmental Biogeochemistry group in the Department of Soil; Water and Environmental Science; University of Arizona, His research group focuses on biogeochemical processes occurring in soil, sediment and water. Of particular interest... Read More →
    avatar for Christopher Guiterman

    Christopher Guiterman

    University of Arizona
    Chris Guiterman is a forest and fire ecologist, with a Masters in Forestry from the University of Maine and a doctorate in Natural Resource Studies from the University of Arizona. His research interests include fire history, dendroecology, human-environmental interactions, and the... Read More →
    avatar for Stephanie Carroll Rainie

    Stephanie Carroll Rainie

    University of Arizona
    Stephanie Carroll Rainie (Ahtna Athabascan), DrPH, MPH is Assistant Professor in  Public Health Policy and Management at College of Public Health. She works in the Community, Environment and Policy Department, Mel and Enid Zuckerman College of Public Health (MEZCOPH); Assistant Research... Read More →



    Wednesday July 18, 2018 9:40am - 11:00am
    Grand Canyon Ballroom
    • Remote Participation Link https://global.gotomeeting.com/join/752150301
    • Remote Participation Access Code 752-150-301
    • Remote Participation Phone # (646) 749-3129 More phone numbers Australia: +61 2 9087 3604 Austria: +43 7 2081 5427 Belgium: +32 28 93 7018 Canada: +1 (647) 497-9391 Denmark: +45 32 72 03 82 Finland: +358 942 72 1060 France: +33 170 950 594 Germany: +49 692 5736 7317 Ireland: +353 16 572 651 Italy: +39 0 247 92 13 01 Netherlands: +31 202 251 017 New Zealand: +64 9 280 6302 Norway: +47 21 93 37 51 Spain: +34 932 75 2004 Sweden: +46 853 527 827 Switzerland: +41 435 5015 61 United Kingdom: +44 20 3713 5028

    11:00am

    Networking Break
    Wednesday July 18, 2018 11:00am - 11:15am
    Foyer 880 E 2nd St, Tucson, AZ 85719

    11:00am

    ESIP Educators Workshop - Computing in the Classroom - Coding, Gadgets, and STEM
    ESIP Education is pleased to offer a 1-day workshop focusing on computer tools that facilitate data exploration. Coding, and coding gadgets (example: Raspberry Pi) are activities that function on a level just above programming yet employ similar critical thinking skills to enable digital data manipulation for specific results. Fifteen regional educators will join the ESIP education community at this stimulating hands-on workshop.

    Speakers & Moderators
    avatar for LuAnn Dahlman

    LuAnn Dahlman

    Science Writer and Editor, NOAA Climate Program Office
    The updated Climate Explorer application.
    avatar for Kalo Haslem

    Kalo Haslem

    STEAM Teacher, TCEA
    STEAM Teacher at a charter school part of the Education cohort
    avatar for Shelley Olds

    Shelley Olds

    Science Education Specialist, UNAVCO
    Data visualization tools, Earth science education, human dimensions of natural hazards, disaster risk reduction (DRR), resilience building.


    Wednesday July 18, 2018 11:00am - 12:45pm
    Canyon C

    11:15am

    Plenary - Informatics activities in Tucson
    • Dr. Bill Smith, Natural Resources & Environment, University of AZ (15+5)
    • Rowena Davis, Belmont Forum e-Infra (15 mins+5)
    • Dr. Bryan Heidorn & Gretchen Stahlman, School of Information, Long-tail Dark Data (15 mins + 5)
    • Dr. Tyson Swetnam, CyVerse (15 mins + 5)
    Plenary wrap-up & Lunch time discussion

    To view a live broadcast of this event streaming on Youtube, click here.


    Speakers & Moderators
    avatar for Rowena Davis

    Rowena Davis

    AZGS
    Rowena Davis is a project coordinator for the Belmont Forum e-Infrastructures and Data Management project. The Belmont Forum is a group of the world's major and emerging funders of global environmental change research. The e-I&DM project aims to develop a coordinated approach to sharing... Read More →
    BH

    Bryan Heidorn

    University of Arizona
    P. Bryan Heidorn is the Director of the University of Arizona School of Information. Prior to coming to the UA, Heidorn was a faculty member of the Graduate School of Library and Information Science at the University of Illinois at Urbana-Champaign. For the last two years he also... Read More →
    avatar for Bill Smith

    Bill Smith

    I am a member of the Earth Dynamics Observatory and PI of the Ecosystem Climate Dynamics lab at the University of Arizona. Our research focuses on understanding the complex responses of the terrestrial biosphere to rising atmospheric CO2, climate change, and land-use change across... Read More →
    avatar for Gretchen Stahlman

    Gretchen Stahlman

    University of Arizona
    Gretchen Stahlman is a PhD candidate in the University of Arizona School of Information. She holds a Master of Science degree in Library Science from Clarion University of Pennsylvania. As a UA iSchool doctoral student, Gretchen has participated in several projects exploring cyberinfrastructure... Read More →
    avatar for Tyson Swetnam

    Tyson Swetnam

    University of Arizona
    My primary responsibilities at CyVerse involve the deployment of Spatial Data Infrastructure (SDI) for life science and agricultural research. I also work closely with the NSF Critical Zone Observatory NetworkOpenTopography, and XSEDEdeploying scalable GIS applications running on CyVerse resources... Read More →


    Wednesday July 18, 2018 11:15am - 12:45pm
    Grand Canyon Ballroom
    • Remote Participation Link https://global.gotomeeting.com/join/752150301
    • Remote Participation Access Code 752-150-301
    • Remote Participation Phone # (646) 749-3129 More phone numbers Australia: +61 2 9087 3604 Austria: +43 7 2081 5427 Belgium: +32 28 93 7018 Canada: +1 (647) 497-9391 Denmark: +45 32 72 03 82 Finland: +358 942 72 1060 France: +33 170 950 594 Germany: +49 692 5736 7317 Ireland: +353 16 572 651 Italy: +39 0 247 92 13 01 Netherlands: +31 202 251 017 New Zealand: +64 9 280 6302 Norway: +47 21 93 37 51 Spain: +34 932 75 2004 Sweden: +46 853 527 827 Switzerland: +41 435 5015 61 United Kingdom: +44 20 3713 5028

    12:30pm

    Lunch
    Lunch will include a facilitated discussion about how meeting participants have made data matter together over the last two decades and how they are making data matter going forward. 

    Speakers & Moderators
    avatar for Steve Diggs

    Steve Diggs

    Technical Director, CCHDO, Scripps Institution of Oceanography / UCSD
    avatar for Christine White

    Christine White

    Technical Advisor, Esri


    Wednesday July 18, 2018 12:30pm - 2:00pm
    Grand Canyon Ballroom

    2:00pm

    Standards and technologies for sensor QA/QC annotations, metadata capture, and automated workflows
    As sensor networks become more ubiquitous and complex, the scientific community requires a convergence on standards for metadata (e.g., deployment conditions and QC annotations) as well as technology tools to facilitate semi- or fully-automated workflows. Protocols such as SensorML and other emerging standards exist, so how well do they fit these needs and how might they be incorporated into new software tools and frameworks? Where are the gaps and opportunities? This session will incorporate a mix of short presentations (5-10min/ea) on existing tools and forward-looking ideas, as well as group discussion.

    Speakers & Moderators
    avatar for Renée F. Brown

    Renée F. Brown

    Information Manager, McMurdo Dry Valleys LTER
    aridland ecosystems, sensor networks in ecology, nitrogen and carbon biogeochemical cycles, climate change, long-term ecological research
    avatar for Scotty Strachan

    Scotty Strachan

    Director of Cyberinfrastructure, University of Nevada, Reno
    Institutional cyberinfrastructure, sensor-based science, mountain climate observatories!


    Wednesday July 18, 2018 2:00pm - 3:30pm
    Canyon B

    2:00pm

    ESIP Educators Workshop - Computing in the Classroom - Coding, Gadgets, and STEM
    ESIP Education is pleased to offer a 1-day workshop focusing on computer tools that facilitate data exploration. Coding, and coding gadgets (example: Raspberry Pi) are activities that function on a level just above programming yet employ similar critical thinking skills to enable digital data manipulation for specific results. Fifteen regional educators will join the ESIP education community at this stimulating hands-on workshop.

    Speakers & Moderators
    avatar for LuAnn Dahlman

    LuAnn Dahlman

    Science Writer and Editor, NOAA Climate Program Office
    The updated Climate Explorer application.
    avatar for Kalo Haslem

    Kalo Haslem

    STEAM Teacher, TCEA
    STEAM Teacher at a charter school part of the Education cohort
    avatar for Shelley Olds

    Shelley Olds

    Science Education Specialist, UNAVCO
    Data visualization tools, Earth science education, human dimensions of natural hazards, disaster risk reduction (DRR), resilience building.


    Wednesday July 18, 2018 2:00pm - 3:30pm
    Canyon C

    2:00pm

    Advancing netCDF-CF - Part 1
    Update and discussion of recent netCDF-CF activities and next steps.

    Speakers & Moderators

    Wednesday July 18, 2018 2:00pm - 3:30pm
    Madera

    2:00pm

    Using Jupyter for Cloud-based Analysis
    Python and Cloud computing have become ubiquitous in the Earth Science community. Jupyter provides a browser-based workbench environment for researchers to create sharable notebooks with code snippets to interact with data and services on the internet. Participants of this workshop will use ESIPhub to learn about using Jupyter notebooks to interact with cloud-based services for scientific research and analysis.

    Speakers & Moderators
    avatar for Sean Gordon

    Sean Gordon

    Metadata Developer, The HDF Group
    Talk to me about the ESIP Labs project, ESIPhub a JupyterHub based shared computational environment for workshops at Meetings.My research focuses on the connections between documentation structures and the evaluation of content for the metadata needs of diverse communities of practice... Read More →
    avatar for Thomas Huang

    Thomas Huang

    Technical Group Supervisor, JPL



    Wednesday July 18, 2018 2:00pm - 3:30pm
    Pima
    • Subject Jump In
    • Remote Participation Link https://global.gotomeeting.com/join/752150301
    • Remote Participation Access Code 752-150-301
    • Remote Participation Phone # (646) 749-3129 More phone numbers Australia: +61 2 9087 3604 Austria: +43 7 2081 5427 Belgium: +32 28 93 7018 Canada: +1 (647) 497-9391 Denmark: +45 32 72 03 82 Finland: +358 942 72 1060 France: +33 170 950 594 Germany: +49 692 5736 7317 Ireland: +353 16 572 651 Italy: +39 0 247 92 13 01 Netherlands: +31 202 251 017 New Zealand: +64 9 280 6302 Norway: +47 21 93 37 51 Spain: +34 932 75 2004 Sweden: +46 853 527 827 Switzerland: +41 435 5015 61 United Kingdom: +44 20 3713 5028
    • Tags Cloud Computing

    2:00pm

    Publishing schema.org Dataset: Lessons Learned and Paths Forward
    Progress surrounding the schema.org type Dataset have made it an attractive way for repositories to expose dataset metadata to search engines. The NSF EarthCube initiative funded a short-term project, P418, to explore what could be achieved if repositories could adopt schema.org as a mechanism for self-publishing information using a common schema. As part of this project a number of repositories volunteered to try publishing schema.org by embedding it in their websites.

    In this session, we will:
    introduce the P418 project goals and the philosophy of behind using schema.org, (15min)
    and then explore some real-word schema.org publishing stories to: (30min)
    hear about the various techniques used and challenges encountered for embedding the schema.org markup in web pages,
    understand how well schema.org covers a repository’s own metadata model,
    discuss where schema.org needs extensions and how the geoscience community can collectively move forward to improving the quality of the markup.

    For more schema.org sessions see:
    Tuesday, July 17 • 11:30am - 1:00pm Metadata Evaluation Lab at ESIP: Assessing if community metadata is ready for a Schema.org
    Tuesday, July 17 • 4:00pm - 5:30pm Semantics in Action

    Speakers & Moderators
    avatar for John Relph

    John Relph

    Disruptor, NESDIS/NCEI
    OneStop, Metadata, Archival, Automation, Data Management, Canaan Dogs
    avatar for Adam Shepherd

    Adam Shepherd

    Technical Director, Co-PI, BCO-DMO @ WHOI
    schema.org | Data Containerization | Linked Data | Semantic Web | Knowledge Representation | Ontologies



    Wednesday July 18, 2018 2:00pm - 3:30pm
    Sabino

    2:00pm

    Software and Services Workshop - Development of Examples to address Use Cases
    Work with us as we expand on the use cases gathered earlier in the week to create guidance and examples for how to represent software or services citation in your work and publications. The final product will be a reference guide for data managers, librarians, and journals to use as guidance for authors and researchers on Best Practices for Software and Services Citations.

    Speakers & Moderators
    JG

    James Gallagher

    President, OPeNDAP
    avatar for Jessica Hausman

    Jessica Hausman

    Data Engineer, PO.DAAC JPL
    avatar for Shelley Stall

    Shelley Stall

    Senior Director, Data Leadership, American Geophysical Union
    Shelley Stall is the Senior Director of Data Leadership at the American Geophysical Union. Shelley has more than two decades of experience working in high-volume, complex data management environments. She has helped organizations in not-for-profit, commercial, defense, and federal... Read More →


    Wednesday July 18, 2018 2:00pm - 3:30pm
    Ventana

    3:30pm

    Networking Break
    Wednesday July 18, 2018 3:30pm - 4:00pm
    Foyer 880 E 2nd St, Tucson, AZ 85719

    4:00pm

    Earth Science Data Uncertainty – White Paper Development
    During the 2017 ESIP Summer Meeting, the Information Quality Cluster (IQC) sponsored a plenary panel session and a breakout session focused on Earth science data uncertainty. Expert panellists presented to the ESIP audience key aspects of scientific quality and addressed questions such as "How is uncertainty determined and characterized in the products of their research or application? What are the major side effects and limitations of common statistical techniques used to quantify and characterize uncertainty? What is the impact of uncertainty on the quality of their data products? How is data uncertainty accounted for when multiple sources of data are spliced and woven into a single product? How do they document and convey the information about uncertainty to other scientific users? What is the best way of conveying uncertainty to the (possibly skeptical) public?" Following considerable discussion during the breakout session, a one of the key action items recommended was that a clear understanding of the concept of uncertainty, and its communication to users was essential, and that the IQC should develop a white paper to satisfy this objective. During the 2017 ESIP Winter Meeting, the IQC held a session titled "Formulation of a White Paper on Earth Science Data Uncertainty" where further presentations by experts was held about the mathematical basis for uncertainty as well as uncertainty from the point of view of scientific data producers and applications' users, followed by subgroup discussions for formulating the white paper. An outline of the white paper has been developed and reviewed by IQC members and a number of individuals have signed up to be co-authors and/or reviewers.

    The purpose of this breakout session is to continue this progress by acquiring a final set considerations and recommendations from additional domain experts spanning multiple disciplines of Earth science and data science. This will be followed by a working session to finalize the outline and establish the final points and issues that should be addressed by the paper.

    The following is a list of speakers, titles and brief descriptions of the talks to be delivered by the panelists:

    Michael Little – NASA AIST Program
    Presentation Title: “An AIST Program View of the Significance of Uncertainty in Data Exploitation”
    Presentation Summary: While uncertainty is important, everyone has a different idea about what it means, how to characterize it and what it should be used for. I hope to provide a brief overview of how I think it’s used in programmatic decisions.

    Jeff Privette – NOAA/NCEI
    Presentation Title: “NOAA/NCEI’s Approaches to Informing Users on Uncertainty”
    Presentation Description: Determining uncertainty in environmental data is critical but typically challenging and expensive. Therefore, many scientists either ignore, guess at, or use validation studies to estimate it. This can lead to misleading results in climate monitoring, especially in derivative or time-series results such as climate anomalies or rankings. NCEI is adopting a policy to inform data users of what uncertainty, validation, and quality assurance has been applied so that users are informed prior to ordering the data.

    Faozi Said – NOAA/NESDIS
    Presentation Title: “Leveraging Cal/Val to Effectively Communicate the Physical Data Limitations”
    Presentation Summary: Using a real world example, we explore on a high level the calibration and validation (Cal/Val) challenges we face and how the possible issues and uncertainty in the data are communicated to the end user.

    Jonathan Hobbs – Jet Propulsion Laboratory
    Presentation Title: “Probability as a Foundation for Data Uncertainty: Applications in Remote Sensing”
    Presentation Summary: Earth science data records often include products that combine models with indirect observations of their quantity of interest. Probability serves as a foundational tool for characterizing uncertainty for these complex methods. This presentation will illustrate these ideas in the context of remote sensing retrievals.


    Speakers & Moderators
    ML

    Mike Little

    AIST Program Manager, NASA
    DM

    David Moroni

    Data Stewardship and User Services Team Lead, Jet Propulsion Laboratory, Physical Oceanography Distributed Active Archive Center
    I am a Senior Science Data Systems Engineer at the Jet Propulsion Laboratory and Data Stewardship and User Services Team Lead for the PO.DAAC Project, which provides users with data stewardship services including discovery, access, sub-setting, visualization, extraction, documentation... Read More →
    avatar for Ge Peng

    Ge Peng

    Research Scholar, CICS-NC/NCEI
    Dataset-centric scientific data stewardship, data quality management
    JP

    Jeff Privette

    NOAA/NCEI
    avatar for Hampapuram Ramapriyan

    Hampapuram Ramapriyan

    Research Scientist/SME, Science Systems and Applications, Inc.
    Information Quality, Data Stewardship, Provenance, Preservation Standards
    FS

    Faozi Said

    NOAA/NESDIS


    Wednesday July 18, 2018 4:00pm - 5:30pm
    Canyon A

    4:00pm

    Community Resilience: Demonstrating the Socioeconomic Value of Earth Science data (Part II)
    Can Earth Science data contribute socioeconomic value by enhancing place-based community resilience? In January, we held Part I - and from that session we developed a roadmap for engaging within the ESIP community to host a data-specific community resilience session. This session will be a breakout session to follow on from that work, with the intention to facilitate collaboration between place-based community resilience data consumers and decision-makers, and ESIP data practitioners and community members.

    This session will seek engagement from city planners, resilience officers, or data consumers to talk about what their objectives are and what data-related pain points they have (e.g. issues they might be having with access, discovery, wrangling or analyzing the earth science data they need). There will be collaborative participation from ESIP data community members (e.g. Semantic Tech committee and ESDA) to work together to develop possible solutions. The session would be run in a workshop-style, with

    Speakers & Moderators

    Wednesday July 18, 2018 4:00pm - 5:30pm
    Canyon B

    4:00pm

    ESIP Educators Workshop - Computing in the Classroom - Coding, Gadgets, and STEM
    ESIP Education is pleased to offer a 1-day workshop focusing on computer tools that facilitate data exploration. Coding, and coding gadgets (example: Raspberry Pi) are activities that function on a level just above programming yet employ similar critical thinking skills to enable digital data manipulation for specific results. Fifteen regional educators will join the ESIP education community at this stimulating hands-on workshop.

    Speakers & Moderators
    avatar for LuAnn Dahlman

    LuAnn Dahlman

    Science Writer and Editor, NOAA Climate Program Office
    The updated Climate Explorer application.
    avatar for Kalo Haslem

    Kalo Haslem

    STEAM Teacher, TCEA
    STEAM Teacher at a charter school part of the Education cohort
    avatar for Shelley Olds

    Shelley Olds

    Science Education Specialist, UNAVCO
    Data visualization tools, Earth science education, human dimensions of natural hazards, disaster risk reduction (DRR), resilience building.


    Wednesday July 18, 2018 4:00pm - 5:30pm
    Canyon C

    4:00pm

    Advancing netCDF-CF - Part 2
    Update and discussion of recent netCDF-CF activities and next steps.

    Speakers & Moderators

    Wednesday July 18, 2018 4:00pm - 5:30pm
    Madera

    4:00pm

    What to do at a PROV roadblock
    To make the PROV graph sing, all nodes should be identifiers to references and should be resolvable and maintained by the appropriate community group. So, when this isn't the case... how do we move ahead anyway?

    This session will focus on:

    A. How to come to community consensus when roadblocks arise. 
    B. How to resolve issues independently and move forward! 

    The session will also explore how the ESIP Community Ontology Repository fits into these challenges.

    Shared Notes: http://bit.ly/PROVroadblocks

    Slide: https://docs.google.com/presentation/d/17NI-U4uRiIkjrTQLd0kB8WgDBIwu-wN_hJxgn4HPNhE/edit?usp=sharing

    Speakers & Moderators
    avatar for Annie Burgess

    Annie Burgess

    ESIP Lab Director, ESIP


    Wednesday July 18, 2018 4:00pm - 5:30pm
    Pima
    • Subject Deep Dive
    • Remote Participation Link https://global.gotomeeting.com/join/752150301
    • Remote Participation Access Code 752-150-301
    • Remote Participation Phone # (646) 749-3129 More phone numbers Australia: +61 2 9087 3604 Austria: +43 7 2081 5427 Belgium: +32 28 93 7018 Canada: +1 (647) 497-9391 Denmark: +45 32 72 03 82 Finland: +358 942 72 1060 France: +33 170 950 594 Germany: +49 692 5736 7317 Ireland: +353 16 572 651 Italy: +39 0 247 92 13 01 Netherlands: +31 202 251 017 New Zealand: +64 9 280 6302 Norway: +47 21 93 37 51 Spain: +34 932 75 2004 Sweden: +46 853 527 827 Switzerland: +41 435 5015 61 United Kingdom: +44 20 3713 5028
    • Tags Disaster Lifecycle, Documentation, Information Quality, Semantic Technologies

    4:00pm

    TaskAPI - A Scalable Computing Platform for Large Scientific Data Systems
    TaskAPI is a workflow platform and DSL (Domain Specific Language) that provides automatic horizontal and vertical scaling of multi-language data-intensive scientific software systems using a functionally declarative workflow paradigm. TaskAPI is capable of quickly wrapping legacy systems, provides structured guidance for best-practices in continued or new development via its JSON DSL, and automatically provides system components with a unified, straightforward API for centralized logging, job and task killing, and configurable property use.

    TaskAPI was developed to serve as the backbone for the reengineered US ASOS Ingest software system and exists as its own distributable package for use by other large polyglot systems.

    This session will begin by providing a broad overview (surface skim) of the TaskAPI platform, including motivation, capabilities, current and potential use cases, and design and performance characteristics.

    The summary will lead into a more detailed look at the TaskAPI structure, including the DSL setup, workflow branching, task types, multi-language parallelization techniques in Java, C, and Fortran, and current and planned language support (Python via Jep, Clojure, Scala via drivers), and other features (Kafka messaging queues).

    After we thoroughly explain the system and its capabilities, we will deep dive into a live example of TaskAPI as it was implemented in ASOS, examining real-life challenges we faced and how to think about and implement best use practices.

    Finally, we will assist session attendees and participants in determining if this system could serve their own projects, provide assistance in TaskAPI download and setup, and solicit feature requests and needs.

    Speakers & Moderators
    avatar for Ryan Berkheimer

    Ryan Berkheimer

    Software Research, GST at NOAA NCEI



    Wednesday July 18, 2018 4:00pm - 5:30pm
    Sabino

    4:00pm

    Data Management Training Working Group Business Meeting & IMLS Project Launch
    The Data Management Training Working Group (DMT WG) is nearing the end of its two year cycle, so this session will be both a business meeting in which participants apprise interested parties of the many activities and accomplishments of the DMT WG during the past two years, and also launch a follow up project focused upon the Data Management Training Clearinghouse (DMTC). The collaboratively developed and maintained Data Management Training Clearinghouse (DMTC) has recently been awarded a three year Institute of Museum & Library Science (IMLS) grant as part of its National Digital Library Program. The IMLS grant will enable the DMTC to expand and enhance its web presence and services by extending and diversifying the content included in the clearinghouse, enhancing cataloging and classification, and developing a method to enable feedback between trainers and trainees. This session is designed as a follow up to a working session in which governance of the DMTC moves from an ESIP Working Group to an Advisory Board structure.

    Speakers & Moderators
    avatar for Karl Benedict

    Karl Benedict

    Director of Research Data Services, University of New Mexico
    For nearly 30 years Karl Benedict has had parallel careers in Information Technology, Data Management and Analysis, and Archaeology. Over the last 22 years at UNM he has worked as a Graduate Student in Anthropology, Research Scientist, Applied Research Center Director, and currently... Read More →
    avatar for Nancy Hoebelheinrich

    Nancy Hoebelheinrich

    Principal, Knowledge Motifs LLC
    See my LinkedIn profile at: https://www.linkedin.com/in/nancy-hoebelheinrich-0576ba3
    SH

    Sophie Hou

    Data Curation and Stewardship Coordinator, National Center for Atmospheric Research
    data management/curation/stewardship: including but not limited to data life cycle, policies, sustainability, education and training, data quality, usability.


    Wednesday July 18, 2018 4:00pm - 5:30pm
    Ventana

    6:30pm

    Research as Art
    Once again at this year’s ESIP Summer Meeting we’ll hold a Research as Art event on the evening of Wednesday, July 18. Our goal is to encourage the ESIP community to use visual media to communicate their data and research; and to think about their research as an ongoing narrative that can be told through visual media. This event is about showing how the ESIP community uses data. You don’t need to consider yourself an artist in order to submit a piece. The idea is to have a range of entries that show the diversity of research done by members of our community, as well as their creativity and the impact of their work, in an engaging and accessible way.

    Research as Art submission call closes July 6.

    Speakers & Moderators
    avatar for Christine White

    Christine White

    Technical Advisor, Esri


    Wednesday July 18, 2018 6:30pm - 8:00pm
    CyVerse 1601 E Helen St, Tucson, AZ 85719

    6:30pm

    20th Anniversary Reception
    Making data matter together. ESIP has supported this vision for 20 years and we are delighted to celebrate this! CyVerse is hosting a reception in their new building.

    We will be welcomed by Cyverse and have brief remarks from Esri and the University of Arizona as our meeting sponsors. 

    Speakers & Moderators
    avatar for Christine White

    Christine White

    Technical Advisor, Esri


    Wednesday July 18, 2018 6:30pm - 8:30pm
    CyVerse 1601 E Helen St, Tucson, AZ 85719
     
    Thursday, July 19
     

    7:30am

    Registration Open
    Thursday July 19, 2018 7:30am - 8:00am
    Foyer 880 E 2nd St, Tucson, AZ 85719

    8:00am

    ESIP Lab: Incubator Outcomes, Google Summer of Code Update, Advances in Provenance, USGS + ESIP Lab Partnership
    A little more than a year ago, we created the ESIP Lab, run by Annie Burgess, the ESIP Lab Director. The ESIP Lab empowers the scientific community by supporting ideation, incubation and evaluation of Earth sciences cyberinfrastructure and encompasses most of the ESIP's funded projects - incubator, FUNding Friday and tech evaluation. In this session, we will give a short primer on the lab and focus on recent outcomes of 2017 funded projects.  

    ESIP Lab and USGS
    • Sky Bristol
    Provenance
    • Lewis McGibbney
    • Doug Fils
    Google Summer of Code
    • Ryan Berkheimer 
    • Lewis McGibbney
    Fall 2017 Funded Incubator Projects 
    Fall 2017 projects are wrapping up their development cycle and will be presented during the Thursday plenary session at the Summer Meeting. The project summaries are here:

    Speakers & Moderators
    avatar for Ryan Berkheimer

    Ryan Berkheimer

    Software Research, GST at NOAA NCEI
    avatar for Annie Burgess

    Annie Burgess

    ESIP Lab Director, ESIP
    avatar for Sean Gordon

    Sean Gordon

    Metadata Developer, The HDF Group
    Talk to me about the ESIP Labs project, ESIPhub a JupyterHub based shared computational environment for workshops at Meetings.My research focuses on the connections between documentation structures and the evaluation of content for the metadata needs of diverse communities of practice... Read More →
    avatar for Andrea Thomer

    Andrea Thomer

    Assistant Professor, University of Michigan, School of Information
    I'm an information scientist interested in biodiversity and earth science informatics, natural history museum data, data curation, information organization, and computer-supported cooperative work! I'm looking for students!
    avatar for Christine White

    Christine White

    Technical Advisor, Esri




    Thursday July 19, 2018 8:00am - 9:15am
    Grand Canyon Ballroom
    • Remote Participation Link https://global.gotomeeting.com/join/752150301
    • Remote Participation Access Code 752-150-301
    • Remote Participation Phone # (646) 749-3129 More phone numbers Australia: +61 2 9087 3604 Austria: +43 7 2081 5427 Belgium: +32 28 93 7018 Canada: +1 (647) 497-9391 Denmark: +45 32 72 03 82 Finland: +358 942 72 1060 France: +33 170 950 594 Germany: +49 692 5736 7317 Ireland: +353 16 572 651 Italy: +39 0 247 92 13 01 Netherlands: +31 202 251 017 New Zealand: +64 9 280 6302 Norway: +47 21 93 37 51 Spain: +34 932 75 2004 Sweden: +46 853 527 827 Switzerland: +41 435 5015 61 United Kingdom: +44 20 3713 5028

    9:30am

    Preparing for the CoreTrustSeal - Insights and Lessons Learned
    As part of a coalition convened by the American Geophysical Union (AGU) and also in collaboration with the Research Data Alliance, ESIP is fostering capabilities to make data Findable, Accessible, Interoperable, and Reusable (FAIR) by contributing to the Enabling FAIR Data project. Among many key contributors and stakeholders of the effort, data repositories and their capabilities are vital to enable and facilitate the FAIR principles as developed by Force11.org.

    In order to understand the data repositories’ services and their maturity levels, there are several options for assessing a data repository, and one possibility that is being considered by the Enabling FAIR Data project is the CoreTrustSeal. While the CoreTrustSeal has provided guidelines and tutorials regarding the assessment process, a data repository might still have questions regarding the best approach in evaluating its services per the CoreTrustSeal requirements.

    During this session, the speakers will present their experiences to advise potential or would-be CoreTrustSeal applicants on several key areas including licensing, ethical norms, funding and preservation level. The speakers will also interact with the audience to answer any additional questions or concerns that the audience might have.

    Session Agenda:
    1. Introduction - Sophie Hou and Doug Schuster (National Center for Atmospheric Research)
    2. Presentations:
      • Lisa Johnston – University of Minnesota
      • Roger Weaver – Missouri University of Science and Technology
      • Bob Downs – CIESIN-SEDAC, Columbia University
    3. Discussion/Q&As
    4. Next steps

    Speakers & Moderators
    SH

    Sophie Hou

    Data Curation and Stewardship Coordinator, National Center for Atmospheric Research
    data management/curation/stewardship: including but not limited to data life cycle, policies, sustainability, education and training, data quality, usability.
    avatar for Kerstin Lehnert

    Kerstin Lehnert

    IGSN e.V.
    Kerstin Lehnert, Chair of the EarthCube Leadership Council (December 2015 - May 2018), is Senior Research Scientist at the Lamont-Doherty Earth Observatory of Columbia University and Director of the NSF-funded data facility IEDA (Interdisciplinary Earth Data Alliance). Kerstin holds... Read More →
    avatar for Roger Weaver

    Roger Weaver

    Scholarly Communications Librarian, Missouri University of Science and Technology


    Thursday July 19, 2018 9:30am - 11:00am
    Canyon A

    9:30am

    Quantifying (Yes Quantifying!) Value of EO Data via Socioeconomics
    If putting a dollar value on your research makes you queasy, yet you must do it to justify and continue your work, this session is here to help. This session will feature 3 presentations on approaches and tools for tying your Earth Observations (EO) data with quantifiable business value. Building upon other ESIP threads such as the benefits of using EO data to build resilient communities, and spatial information as a key to link science, demographics, and economic value, presentations will focus on business use cases for EO, how to connect observation data to economic datasets, and how to position your EO research products. During the latter half of the session, we invite discussion on real world experiences, and challenges and successes you encounter.

    Presentations:
    1) What's the Value of Integrating Socioeconomic and Earth Observations Data? - Bob Downs (CIESIN), Bob Chen (CIESIN), and Karen Moe (NASA GSFC)
    2) Increase the Relevance, Impact, and Efficiency of Your Research - Ben Hickson (University of Arizona)
    3) Understanding Value to Articulate Worth in EO Data - Christine White (Esri), Laura McNulty (Esri), and Tripp Corbett (Esri)

    Speakers & Moderators
    avatar for Christine White

    Christine White

    Technical Advisor, Esri


    Thursday July 19, 2018 9:30am - 11:00am
    Canyon B

    9:30am

    EarthCube CDF General Assembly Meeting
    The Council of Data Facilities (CDF) is committed to working with relevant agencies, professional associations, initiatives, and other complementary efforts to enable transformational science, innovative education, and informed public policy through increased coordination, collaboration, and innovation in the acquisition, curation, preservation, and dissemination of geoscience data, tools, models, and services. Existing and emerging geoscience data facilities – through the Council – are committed to serving as an effective foundation for EarthCube. The General Assembly meeting is open to the official representatives from all member data facilities, additional member organization personnel as desired by the members, as well as observers. Agenda 9:30am Intro and sign-in of CDF members and guests {Tim Ahern, Lynne Schreiber) 10:00am P418/19 and the CDF (Mohan Ramamurthy) 10:30am Infrastructure sharing for CDF (Kerstin/Tim Ahern) 11:00-11:20am Break 11:20am Proposed Changes in CDF Charter, Formal Vote on Charter Changes by Active members (Tim Ahern) Election of new CDF Exec, Introduction of Slate of Candidates (Lindsey Powers, Tim Ahern) 12:10pm ROI metrics (Corinna Gries) 12:40pm ORCiD (Eric Olson, ORCiD) 13:10pm Election Results (Lynne Schreiber) 13:15pm End of Meeting

    Speakers & Moderators
    avatar for Eric Olson

    Eric Olson

    Engagement Lead, North America., ORCID
    Eric supports ORCID members as they develop new and existing integrations and workflows. Before joining ORCID, Eric worked on the PressForward publishing software at the Roy Rosenzweig Center for History and New Media, where he recruited and trained research organizations to utilize... Read More →


    Thursday July 19, 2018 9:30am - 11:00am
    Canyon C

    9:30am

    How can CHRS RainSphere and USDA AgRisk Viewer teams promote the use of their tools for climate resilience?
    This session-within-a-session comprises two parts. Attendees will first hear how CHRS (Center for Hydrometeorology and Remote Sensing) RainSphere staff and USDA SWCH (Southwest Climate Hub) staff help make climate data and climate projections more easily understood and accessible to their respective users. Then, session attendees will draw from each presentation to outline a case study for the Climate Resilience Toolkit (CRT). Attendees can use strategies from the session to highlight their own projects' accomplishments.

    This is the 5th workshop sponsored by ESIP's Agriculture & Climate Cluster to develop CRT case studies.

    Workshop agenda:

    [5 min] Introduction to the workshop, logistics, larger goal to establish a CRT pipeline at the ESIP level

    [5 min] LuAnn Dahlman, NOAA, Introduction to CRT, including a backwards journey from a CRT case study to the story template

    [15 min] Hoang Tran (Univ. of California, Irvine), CHRS RainSphere - a new user friendly tool for analyzing global remotely sensed rainfall estimates (HoangT abstract)

    [15 min] Julian Reyes (USDA Southwest Climate Hub), Toward accessible, discoverable, and usable crop insurance data: Multi-scale analysis and visualization of cause of loss (JulianR abstract)

    [45 min] Group discussion on and drafting of an incipient story, for each presentation, that would become a CRT case study

    [5 min] Wrap-up: Next steps.

    Speakers & Moderators
    avatar for LuAnn Dahlman

    LuAnn Dahlman

    Science Writer and Editor, NOAA Climate Program Office
    The updated Climate Explorer application.
    avatar for Nancy Hoebelheinrich

    Nancy Hoebelheinrich

    Principal, Knowledge Motifs LLC
    See my LinkedIn profile at: https://www.linkedin.com/in/nancy-hoebelheinrich-0576ba3
    avatar for Julian Reyes

    Julian Reyes

    Climate Hub Fellow, USDA Southwest Climate Hub
    avatar for Bill Teng

    Bill Teng

    Principal Scientist, NASA GES DISC (ADNET)
    avatar for Hoang Viet Tran

    Hoang Viet Tran

    Ph.D. Candidate, University of California, Irvine
    Ph.D. student from University of California, Irvine



    Thursday July 19, 2018 9:30am - 11:00am
    Madera

    9:30am

    Using Cloud Object Stores for Data Storage and Data Services
    This session will review and discuss a variety of approaches for using cloud object store technologies to support earth system science (ESS) data storage and data services. On the data storage side, we will discuss a range of projects from those storing existing ESS data files in object stores to projects developing data formats designed with object stores in mind (Zarr, TileDB, etc.).

    On the data service end, we will discuss architectures and solutions to leverage object stores for improving data access and analysis (NEXUS, WSWM, HSDS, etc.). Agenda:
    • Intro
    • James Gallagher: “Adapting existing software to the cloud and preserving the user experience”
    • Frank Greguska: “Using S3 in Apache Science Data Analytics Platform (SDAP) for data ingestion and analytics
    • Lauren Frederick: “Cumulus”
    • John Readey: "HDF in the Cloud - HSDS"
    • Rich Signell: "Cloud-friendly ndarray formats"
    • Discussion

    Speakers & Moderators
    avatar for Ethan Davis

    Ethan Davis

    UCAR Unidata
    JG

    James Gallagher

    President, OPeNDAP
    avatar for Thomas Huang

    Thomas Huang

    Technical Group Supervisor, JPL


    Thursday July 19, 2018 9:30am - 11:00am
    Pima
    • Subject Deep Dive
    • Remote Participation Link https://global.gotomeeting.com/join/752150301
    • Remote Participation Access Code 752-150-301
    • Remote Participation Phone # (646) 749-3129 More phone numbers Australia: +61 2 9087 3604 Austria: +43 7 2081 5427 Belgium: +32 28 93 7018 Canada: +1 (647) 497-9391 Denmark: +45 32 72 03 82 Finland: +358 942 72 1060 France: +33 170 950 594 Germany: +49 692 5736 7317 Ireland: +353 16 572 651 Italy: +39 0 247 92 13 01 Netherlands: +31 202 251 017 New Zealand: +64 9 280 6302 Norway: +47 21 93 37 51 Spain: +34 932 75 2004 Sweden: +46 853 527 827 Switzerland: +41 435 5015 61 United Kingdom: +44 20 3713 5028
    • Tags Cloud Computing

    9:30am

    Custom Built Jupyter Widgets for Earth Science
    Presentation slides: g.co/earth/esip2018-widgets

    For many scientific questions in the Earth sciences, the sheer volume of observed and/or modeled data is a barrier to progress, as it is difficult to explore and analyze using the traditional paradigm of downloading datasets to a local computer for analysis. Furthermore, methods for communicating Earth science algorithms that operate on large datasets in an easily understandable and reproducible way are needed. The Jupyter project has created several tools for general data science that can be leverage for exploratory data analysis of tera- to peta-byte scale geospatial data datasets.

    This session will be a hands-on introduction to:
    • JupyterLab (the Jupyter project's next-generation UI)
    • Jupyter Widgets (the interconnection between the UI and a Python kernel)
    • Earth Engine (Google's cloud-based geospatial analysis API)
    • Examples of satellite data exploration and analysis
    In addition, we will be using the following technologies:
    • JupyterHub for hosting the multi-user environment
    • Docker for packaging up JupyterLab, the Earth Engine Python API, and dozens of scientific Python packages
    • GitHub for sharing all of the session content
    Learn more about Jupyter and attend the other workshops using ESIPhub:

    * Tuesday morning includes a general overview of Jupyter usage in our community.
    http://sched.co/Eype
    * Just after the overview session is a Metadata Improvement Lab focused on schema.org for datasets.  
    http://sched.co/Eypl
    * Wednesday afternoon is a workshop for cloud-based analysis.
    http://sched.co/EyqK


    Speakers & Moderators
    avatar for Tyler Erickson

    Tyler Erickson

    Developer Advocate, Google
    avatar for Sean Gordon

    Sean Gordon

    Metadata Developer, The HDF Group
    Talk to me about the ESIP Labs project, ESIPhub a JupyterHub based shared computational environment for workshops at Meetings.My research focuses on the connections between documentation structures and the evaluation of content for the metadata needs of diverse communities of practice... Read More →



    Thursday July 19, 2018 9:30am - 11:00am
    Sabino

    9:30am

    JSON Encodings for Spatial Data: Data Modeling, Dialects and Languages
    A number of JSON serializations exist for representing Earth Observation data however further work needs to be undertaken to align efforts, reduce overlap and expand/evangelize usage and software implementations. Some examples include TopoJSON, JSON-LD, hdf5-json, GeoJSON, CovJSON, CF-JSON, NCO-JSON, STAR JSON and there are several others.
    Due to initiatives such as the W3C + OGC Spatial Data on the Web Working Group (which resulted in a significant advancement of the CovJSON standard) this issue is gathering so much interest that NASA has recently initiated a dedicated Earth Science Data Systems Working Group to investigate, evaluate and provide a formal NASA recommendation for use of JSON Encodings for Spatial Data.
    HDF5/JSON preserves the data and metadata of any HDF5 dataset through a round-trip encoding (i.e., HDF5 -> JSON -> HDF5). Thus HDF5/JSON is automatically 100% lossless. NCO-JSON is a compa-rable turnkey solution that serves for netCDF a similar role as HDF5/JSON for HDF5. Although netCDF can be implemented as a subset of HDF5, the two APIs and their vocabularies are so differ-ent that using or extending HDF5/JSON to represent netCDF files would be unnecessarily complex. CF-JSON, ERDDAP, and STAR JSON all implement the NCO-JSON dialect, designed to represent any data stored in netCDF format. Differences include that CF-JSON is designed to extend to higher-level CF constructs while STAR JSON includes a library for potentially faster conversion to and from JSON. NCO-JSON also provides lossy options to reduce JSON verbosity and size and increase legibil-ity.
    Critical production-grade software infrastructure such as OPeNDAP also provides several mecha-nisms for serializing JSON data retrievals. Very recently, OPeNDAP developers have provided the ability to retrieve CovJSON responses from OPeNDAP queries. We anticipate that this functionality will enable a new generation of high performance applications.

    Some stakeholders and interested parties in this area include:
    · Web application developers tasked with designing and developing applications which con-sume EO spatial data.
    · Parties interested in serving and consuming Spatial data on the Web.
    · Developers at data centers who currently distribute/expose endpoints/resources which serve spatial data
    Two back-to-back sessions will therefore cover (i) data modeling issues; providing an opportunity to evaluate the list of candidate JSON encodings and exploring model semantics, (ii) applications which leverage JSON serialization for EO data, and (iii) use cases which could explore and possibly benefit from use of JSON serializations for improving application performance.
    The first session will offer four 20 mins presentations with the second workshop offering a hands on investigation into then extension of existing systems’ ability to return richer JSON encodings.

    Agenda: 
    9:30am-9:50am - NCO-JSON, Charlie Zender, UCI <zender@uci.edu>10:00am-10:20am - CoverageJSON, Jon Blower, The Institute for Environmental Analytics <j.blower@the-iea.org>10:30am-10:50am - HDF5-JSON, Aleksandar Jelenak, HDF Group <ajelenak@hdfgroup.org>



    Thursday July 19, 2018 9:30am - 11:00am
    Ventana

    11:00am

    Networking Break
    Thursday July 19, 2018 11:00am - 11:30am
    Foyer 880 E 2nd St, Tucson, AZ 85719

    11:30am

    Earth Science Data Uncertainty – White Paper Development – Working Session
    During the 2017 ESIP Summer Meeting, the Information Quality Cluster (IQC) sponsored a plenary panel session and a breakout session focused on Earth science data uncertainty. Expert panellists presented to the ESIP audience key aspects of scientific quality and addressed questions such as "How is uncertainty determined and characterized in the products of their research or application? What are the major side effects and limitations of common statistical techniques used to quantify and characterize uncertainty? What is the impact of uncertainty on the quality of their data products? How is data uncertainty accounted for when multiple sources of data are spliced and woven into a single product? How do they document and convey the information about uncertainty to other scientific users? What is the best way of conveying uncertainty to the (possibly skeptical) public?" Following considerable discussion during the breakout session, a one of the key action items recommended was that a clear understanding of the concept of uncertainty, and its communication to users was essential, and that the IQC should develop a white paper to satisfy this objective. During the 2017 ESIP Winter Meeting, the IQC held a session titled "Formulation of a White Paper on Earth Science Data Uncertainty" where further presentations by experts was held about the mathematical basis for uncertainty as well as uncertainty from the point of view of scientific data producers and applications' users, followed by subgroup discussions for formulating the white paper. An outline of the white paper has been developed and reviewed by IQC members and a number of individuals have signed up to be co-authors and/or reviewers.

    This IQC session represents a follow-on to the preceding ESIP 2018 Summer Meeting IQC breakout session which intends to provide an opportunity to acquire a final set considerations and recommendations from additional domain experts toward the development of the white paper. This purpose of this working session is to finalize the white paper outline and establish the final points and issues that should be addressed by the paper. This session will also function as a “last call” for any additional requests for co-authorship or reviewers for the proposed paper.


    Speakers & Moderators
    DM

    David Moroni

    Data Stewardship and User Services Team Lead, Jet Propulsion Laboratory, Physical Oceanography Distributed Active Archive Center
    I am a Senior Science Data Systems Engineer at the Jet Propulsion Laboratory and Data Stewardship and User Services Team Lead for the PO.DAAC Project, which provides users with data stewardship services including discovery, access, sub-setting, visualization, extraction, documentation... Read More →
    avatar for Ge Peng

    Ge Peng

    Research Scholar, CICS-NC/NCEI
    Dataset-centric scientific data stewardship, data quality management
    avatar for Hampapuram Ramapriyan

    Hampapuram Ramapriyan

    Research Scientist/SME, Science Systems and Applications, Inc.
    Information Quality, Data Stewardship, Provenance, Preservation Standards


    Thursday July 19, 2018 11:30am - 1:00pm
    Canyon A

    11:30am

    Natural history museum informatics: new methods, old data
    Natural history museums and related databases (e.g. the Paleobiology database, GBIF, Neotoma, Pangea) house a wealth of rich data about our planet and its biological and geological history. Next generation, "Big Data" approaches (e.g. machine learning, network analysis, natural language processing) offer exciting new ways of analyzing this data (paleoinformatics! Network paleoecology! etc). However, museum data collections must be well curated and accessible in order to make this work possible; and informaticians using these data must be aware of public datasets’ potential limitations in their work.

    In this session, we'll present recent work at the intersection of new computational methods and old data, and discuss the infrastructures, algorithms, curatorial workflows and other considerations needed for this work. We mean "old data" in both the geologic sense, and in the "this has been on a museum shelf for 100 years" sense, and anticipate that this session will be of interest to folks in paleontology, natural history museum collections management, data curation, and more.

    We intend this to be an interactive session, aimed at fostering discussion and building community around natural history informatics projects. What projects are you working on that make use of data derived from museum specimens or physical samples? What new approaches are you using? What new methods would you like to use?

    Tentative schedule:
    Short talks
    Introduction to and motivations for this session - Andrea Thomer
    Quantifying ecological impacts of mass extinctions with network analysis of fossil communities - A.D. Muscante
    Work on 3d fossil scans - Gary Motz
    Physical samples and schema.org - Doug Fils

    Discussion - convened by Peter Fox and Andrea Thomer

    Notes doc: http://bit.ly/2L2O9Si

    Speakers & Moderators
    avatar for Gary Motz

    Gary Motz

    Chief Information Officer, Assistant Director for Information Services, Indiana University | Indiana Geological and Water Survey
    Gary is an earth scientist and curator of data, metadata, and natural history collections. In his role at the Indiana Geological and Water survey, he oversees the information services division which provides cartographic, information technology, cyberinfrastructure, data science... Read More →
    avatar for Andrea Thomer

    Andrea Thomer

    Assistant Professor, University of Michigan, School of Information
    I'm an information scientist interested in biodiversity and earth science informatics, natural history museum data, data curation, information organization, and computer-supported cooperative work! I'm looking for students!


    Thursday July 19, 2018 11:30am - 1:00pm
    Canyon B

    11:30am

    EarthCube CDF General Assembly Meeting
    The Council of Data Facilities (CDF) is committed to working with relevant agencies, professional associations, initiatives, and other complementary efforts to enable transformational science, innovative education, and informed public policy through increased coordination, collaboration, and innovation in the acquisition, curation, preservation, and dissemination of geoscience data, tools, models, and services. Existing and emerging geoscience data facilities – through the Council – are committed to serving as an effective foundation for EarthCube. The General Assembly meeting is open to the official representatives from all member data facilities, additional member organization personnel as desired by the members, as well as observers. Agenda 9:30am Intro and sign-in of CDF members and guests {Tim Ahern, Lynne Schreiber) 10:00am P418/19 and the CDF (Mohan Ramamurthy) 10:30am Infrastructure sharing for CDF (Kerstin/Tim Ahern) 11:00-11:20am Break 11:20am Proposed Changes in CDF Charter, Formal Vote on Charter Changes by Active members (Tim Ahern) Election of new CDF Exec, Introduction of Slate of Candidates (Lindsey Powers, Tim Ahern) 12:10pm ROI metrics (Corinna Gries) 12:40pm ORCiD (Eric Olson, ORCiD) 13:10pm Election Results (Lynne Schreiber) 13:15pm End of Meeting

    Speakers & Moderators
    avatar for Eric Olson

    Eric Olson

    Engagement Lead, North America., ORCID
    Eric supports ORCID members as they develop new and existing integrations and workflows. Before joining ORCID, Eric worked on the PressForward publishing software at the Roy Rosenzweig Center for History and New Media, where he recruited and trained research organizations to utilize... Read More →


    Thursday July 19, 2018 11:30am - 1:00pm
    Canyon C

    11:30am

    Metadata Tools: Talks and Discussion – Working focus on tools for publishing, curation, evaluation, and other metadata tasks from around the ESIP community
    Whether we like it or not, metadata is an essential component of the entire data life cycle. This session will highlight solutions from various organizations and groups that are developing and integrating metadata curation, publishing, and evaluation tools. The lightning-style talks will convey:

    - Purpose of the Tool
    - Strengths of the Tool
    - Weaknesses of the Tool
    - Planned Improvements
    - Dreams of New Capabilities

    Following the talks, there will be a collaborative discussion. Slido will be used to capture the responses and attendees will be able to vote on what topics they want to discuss. Please bring your metadata tools questions and ideas to the session!

    Scheduled Talks:
    1. NASA: Metadata Curation Dashboard - Jeanne le Roux
    2. NOAA/NCEI: Collection Metadata Editing Tool (CoMET) - John Relph
    3. NOAA/NCEI: Completeness Rubrics, templates, GCMD Keyword Guidance and GCMD Checker and/or OneStop Discovery and Access UI - Anna Milan
    4. USGS: ADIwg Open Source Metadata Toolkit - Dennis Walworth
    5. CEDAR: Metadata Workbench - John Graybeal
    6. DataOne: DataONE framework - Amber Budden, Dave Vieglais and Matt Jones
    7. CINERGI/DDH - Steve Richard


    Speakers & Moderators
    AB

    Amber Budden

    Director for Community Engagement and Outreach, DataONE
    avatar for John Graybeal

    John Graybeal

    Technical Program Manager, CEDAR and BioPortal, Stanford University
    Metadata, semantics, and cool repositories for metadata and semantics. | Cool Earth Science (or biomedical) projects that will change the world. | Or at least, change the way we manage metadata about the world.
    avatar for Matt Jones

    Matt Jones

    Director of Informatics, UC Santa Barbara
    Data Federation | Open Science | Provenance and Semantics
    avatar for Anna Milan

    Anna Milan

    NOAA National Centers for Environmental Information (NCEI)
    ~*~Metadata Adds Meaning~*~
    avatar for John Relph

    John Relph

    Disruptor, NESDIS/NCEI
    OneStop, Metadata, Archival, Automation, Data Management, Canaan Dogs
    JL

    Jeanne le Roux

    Research Associate, DSIG/ ESSC
    avatar for Tyler Stevens

    Tyler Stevens

    CMR Metadata Quality Team, NASA EED-2 / SGT
    DV

    Dave Vieglais

    University of Kansas / DataONE



    Thursday July 19, 2018 11:30am - 1:00pm
    Madera

    11:30am

    Interactive Data Analysis on Cloud Environment
    From hype to real world applications, cloud computing is proven as the platform to tackle our big data challenges. With the elasticity of the cloud, it is possible for us to develop solutions for our community to analysis and interact large collection of data where the actual computing is performed on the cloud next to the data. This session welcomes speakers to discuss and demonstrate their cloud-based solution for interactive science analysis. Invited speakers: Emily Law/JPL - JPL's Trek technology for interactive exploration of planetary and earth data; Rich Signell/USGS - Interactive, data-proximate analysis of earth system model data on the Cloud; Sudhir Shrestha/ESRI - Improved decision support system with Real time flood inundation forecast Applications; Joe Jacob/JPL - OceanWorks: Ocean Science Data Analytics using Apache Science Data Analytics Platform

    Speakers & Moderators
    avatar for Thomas Huang

    Thomas Huang

    Technical Group Supervisor, JPL
    avatar for Rich Signell

    Rich Signell

    Oceanographer, USGS
    Ocean Modeling, Python, NetCDF, THREDDS, ERDDAP, UGRID, SGRID, CF-Conventions, Jupyter, JupyterHub, CSW, TerriaJS


    Thursday July 19, 2018 11:30am - 1:00pm
    Pima
    • Subject Jump In
    • Remote Participation Link https://global.gotomeeting.com/join/752150301
    • Remote Participation Access Code 752-150-301
    • Remote Participation Phone # (646) 749-3129 More phone numbers Australia: +61 2 9087 3604 Austria: +43 7 2081 5427 Belgium: +32 28 93 7018 Canada: +1 (647) 497-9391 Denmark: +45 32 72 03 82 Finland: +358 942 72 1060 France: +33 170 950 594 Germany: +49 692 5736 7317 Ireland: +353 16 572 651 Italy: +39 0 247 92 13 01 Netherlands: +31 202 251 017 New Zealand: +64 9 280 6302 Norway: +47 21 93 37 51 Spain: +34 932 75 2004 Sweden: +46 853 527 827 Switzerland: +41 435 5015 61 United Kingdom: +44 20 3713 5028
    • Tags Cloud Computing, Data Analytics, Science Software

    11:30am

    Machine Learning Working Session
    Machine Learning engagement activities to increase the connectivity among data providers, Earth scientists, machine learning practicioners and computer service providers

    Speakers & Moderators

    Thursday July 19, 2018 11:30am - 1:00pm
    Sabino

    11:30am

    JSON Encodings for Spatial Data: Applications and Use Cases
    A number of JSON serializations exist for representing Earth Observation data however further work needs to be undertaken to align efforts, reduce overlap and expand/evangelize usage and software implementations. Some examples include TopoJSON, JSON-LD, hdf5-json, GeoJSON, CovJSON, CF-JSON, NCO-JSON, STAR JSON and there are several others.
    Due to initiatives such as the W3C + OGC Spatial Data on the Web Working Group (which resulted in a significant advancement of the CovJSON standard) this issue is gathering so much interest that NASA has recently initiated a dedicated Earth Science Data Systems Working Group to investigate, evaluate and provide a formal NASA recommendation for use of JSON Encodings for Spatial Data.
    HDF5/JSON preserves the data and metadata of any HDF5 dataset through a round-trip encoding (i.e., HDF5 -> JSON -> HDF5). Thus HDF5/JSON is automatically 100% lossless. NCO-JSON is a compa-rable turnkey solution that serves for netCDF a similar role as HDF5/JSON for HDF5. Although netCDF can be implemented as a subset of HDF5, the two APIs and their vocabularies are so differ-ent that using or extending HDF5/JSON to represent netCDF files would be unnecessarily complex. CF-JSON, ERDDAP, and STAR JSON all implement the NCO-JSON dialect, designed to represent any data stored in netCDF format. Differences include that CF-JSON is designed to extend to higher-level CF constructs while STAR JSON includes a library for potentially faster conversion to and from JSON. NCO-JSON also provides lossy options to reduce JSON verbosity and size and increase legibil-ity.
    Critical production-grade software infrastructure such as OPeNDAP also provides several mecha-nisms for serializing JSON data retrievals. Very recently, OPeNDAP developers have provided the ability to retrieve CovJSON responses from OPeNDAP queries. We anticipate that this functionality will enable a new generation of high performance applications.

    Some stakeholders and interested parties in this area include:
    · Web application developers tasked with designing and developing applications which con-sume EO spatial data.
    · Parties interested in serving and consuming Spatial data on the Web.
    · Developers at data centers who currently distribute/expose endpoints/resources which serve spatial data
    Two back-to-back sessions will therefore cover (i) data modeling issues; providing an opportunity to evaluate the list of candidate JSON encodings and exploring model semantics, (ii) applications which leverage JSON serialization for EO data, and (iii) use cases which could explore and possibly benefit from use of JSON serializations for improving application performance.
    The first session will offer four 20 mins presentations with the second workshop offering a hands on investigation into then extension of existing systems’ ability to return richer JSON encodings.

    Speakers & Moderators
    JG

    James Gallagher

    President, OPeNDAP
    avatar for David LeBauer

    David LeBauer

    University of Arizona
    avatar for Bob Simons

    Bob Simons

    IT Specialist, NMFS SWFSC ERD
    I work on ERDDAP, a free and open source data server that gives you a simple, consistent way to download subsets of gridded and tabular scientific datasets in common file formats and make graphs and maps. ERDDAP has been installed and used by more than 70 organizations around the... Read More →


    Thursday July 19, 2018 11:30am - 1:00pm
    Ventana

    1:00pm

    Lunch | Unconference Session Pitch-it
    Are there topics that you want to discuss but not a session to do it in? Was there a session that ended before you were through hashing out the details? This is the opportunity.

    Unconference Schedule: https://docs.google.com/presentation/d/1ZWR-_KCdCvambawddc67_btnJaRUmm-bapqyYQYvdpM/edit?usp=sharing


     

    Speakers & Moderators
    avatar for Annie Burgess

    Annie Burgess

    ESIP Lab Director, ESIP


    Thursday July 19, 2018 1:00pm - 2:10pm
    Grand Canyon Ballroom

    2:15pm

    Unconference
    Thursday July 19, 2018 2:15pm - 3:00pm
    Grand Canyon Ballroom

    3:15pm

    Unconference
    Thursday July 19, 2018 3:15pm - 4:00pm
    Grand Canyon Ballroom

    4:15pm

    Unconference
    Thursday July 19, 2018 4:15pm - 5:00pm
    Grand Canyon Ballroom

    5:00pm

    6:30pm

    FUNding Friday Networking + Poster Making
    Join us at Gentle Ben's to make a poster or find a team + make a poster for FUNding Friday (FF).

    FUNding Friday is an annual mini-grant competition associated with ESIP’s Summer Meeting. The mini-grants are available to ESIP members ($5000) and to students and Education Committee workshop participants ($3000), with total number of awards specified annually and generally 2-4 awards per participant group.
    Interested participants must exhibit a poster describing the project during the Poster Pitch session (Friday morning, check the Summer Meeting schedule for specific time and place). The poster should be hung in the provided space before the pitch session begins.
    The poster size is limited to 25 by 30 inches. It can be hand-drawn; materials for the posters are provided to interested participants during the FF Poster event Thursday night. 


    Speakers & Moderators
    avatar for Annie Burgess

    Annie Burgess

    Lab Director, ESIP
    Annie a postdoctoral fellow in the Computer Science Department at the University of Southern California and Project Assistant at NASA/JPL. She has a PhD in Geography with a focus on satellite remote sensing of snow and ice. Annie is an ASF member, Apache Tika PMC committer, and advocate... Read More →
    avatar for Annie Burgess

    Annie Burgess

    ESIP Lab Director, ESIP
    avatar for Steve Diggs

    Steve Diggs

    Technical Director, CCHDO, Scripps Institution of Oceanography / UCSD


     
    Friday, July 20
     

    7:30am

    Registration Open
    Friday July 20, 2018 7:30am - 8:00am
    Foyer 880 E 2nd St, Tucson, AZ 85719

    8:30am

    FUNding Friday Pitch-it

    Speakers & Moderators
    avatar for Annie Burgess

    Annie Burgess

    Lab Director, ESIP
    Annie a postdoctoral fellow in the Computer Science Department at the University of Southern California and Project Assistant at NASA/JPL. She has a PhD in Geography with a focus on satellite remote sensing of snow and ice. Annie is an ASF member, Apache Tika PMC committer, and advocate... Read More →
    avatar for Steve Diggs

    Steve Diggs

    Technical Director, CCHDO, Scripps Institution of Oceanography / UCSD


    Friday July 20, 2018 8:30am - 9:15am
    Grand Canyon Ballroom

    9:30am

    Science Gateways in the Cloud, a Platform for Providing Modern Scientific Workflows for Reproducible Research and Collaboration
    The advent and maturity of cloud computing technologies and tools have opened new avenues for addressing both Big Data and Open Science challenges to accelerate scientific discoveries. There is broad consensus that as data volumes grow rapidly, it is particularly important to reduce data movement and bring processing and computations to the data. Data providers also need to give scientists an ecosystem that includes data, tools, workflows and other end-to-end applications and services needed to perform analysis, integration, interpretation, and synthesis - all in the same environment or platform. Instead of moving data to processing systems near users, as is the tradition, one will need to bring processing, computing, analysis and visualization to data – so called data-proximate workbench capabilities, also known as server-side processing.

    Cloud-based Science Gateways, through online portals and user-friendly interfaces, provide access to a range of resources that are of interest to a community of researchers, educators, and students, including datasets, tools, services, and workspaces. These offerings permit researchers to access a suite of capabilities to not only achieve reproducible science in a web-based workspace but also provide a platform for collaboration and conducting team science. In this session, speakers will present on-going efforts to develop cloud-based Science Gateways to facilitate end-to-end scientific workflows for communities of researchers, educators, and students in the geosciences.

    Google Drive for this Session

    Nancy Wilkins-Diehr, San Diego Supercomputing Center (15 minutes)
    Title: Science Gateways Overview

    Mohan Ramamurthy, Unidata (15 minutes)
    Title: Unidata Science Gateway

    Eric Lingerfelt, EarthCube Science Support Office (15 minutes)
    Title: Towards an Earthcube Science Gateway

    Julien Chastang, Unidata (15 minutes)
    Title: Unidata Science Gateway JupyterHub

    Discussion (with whatever time remains)

    Speakers & Moderators
    avatar for Julien Chastang

    Julien Chastang

    Software Engineer, UCAR - Unidata
    Scientific software developer at UCAR-Unidata.
    avatar for Eric Lingerfelt

    Eric Lingerfelt

    Technical Officer, EarthCube
    Eric Lingerfelt is the EarthCube Science Support Office Technical officer and comes to ESSO from Oak Ridge National Laboratory in Oak Ridge, Tennessee, where he specialized in the design, development, and deployment of full stack application systems in support of multiple areas of... Read More →
    avatar for Nancy Wilkins-Diehr

    Nancy Wilkins-Diehr

    Associate Director, San Diego Supercomputer Center
    Science gateways and running



    Friday July 20, 2018 9:30am - 11:00am
    Canyon A

    9:30am

    Enhancing the Robustness of Data - Information Quality and Usability Principles Can Help
    The session will start with an introduction to key areas of data quality, how data quality measures have been applied to Earth Science data, and why it's important. The discussion will continue with how usability can provide utility to various applications and disciplines. These two presentations will lead to an overview of a web application for time series data quality control.

    After these respective overviews, a more specific demo will be provided for the quality control software, including an explanation for how it works and how it represents a synthesis of information quality and usability. During this demo, audience members will participate in usability testing and provide live feedback about how intuitive the application's various interfaces are. In the end, participants will obtain a clearer understanding of the importance of information quality and usability, and how the related principles can help quality control software to be useful to its target community.

    Speakers & Moderators
    SH

    Sophie Hou

    Data Curation and Stewardship Coordinator, National Center for Atmospheric Research
    data management/curation/stewardship: including but not limited to data life cycle, policies, sustainability, education and training, data quality, usability.
    DM

    David Moroni

    Data Stewardship and User Services Team Lead, Jet Propulsion Laboratory, Physical Oceanography Distributed Active Archive Center
    I am a Senior Science Data Systems Engineer at the Jet Propulsion Laboratory and Data Stewardship and User Services Team Lead for the PO.DAAC Project, which provides users with data stewardship services including discovery, access, sub-setting, visualization, extraction, documentation... Read More →
    avatar for Ge Peng

    Ge Peng

    Research Scholar, CICS-NC/NCEI
    Dataset-centric scientific data stewardship, data quality management
    avatar for Hampapuram Ramapriyan

    Hampapuram Ramapriyan

    Research Scientist/SME, Science Systems and Applications, Inc.
    Information Quality, Data Stewardship, Provenance, Preservation Standards


    Friday July 20, 2018 9:30am - 11:00am
    Canyon B

    9:30am

    Enabling FAIR Data - Project Status
    The Enabling FAIR Data project will be moving quickly towards implementation. This session will provide the current status of the Commitment Statement and other project activities and outcome. This is a community-driven effort convened by the AGU. The purpose is to make scientific data open and FAIR across the Earth, space, and environmental science community. The FAIR Guiding Principles promote that data (including software and other research products) are Findable, Accessible, Interoperable, and Reusable.

    This project provides common guidelines, recommendations, and policies for all journals and repositories in support of data (and other digital research products) being submitted and preserved in repositories with proper citation in the scholarly paper. This will allow data to be discoverable as a digital research product and not embedded in a supplement with inadequate metadata.

    Speakers & Moderators
    avatar for Denise Hills

    Denise Hills

    Director, Energy Investigations, Geological Survey of Alabama
    Long tail data, data preservation, connecting physical samples to digital information, geoscience policy, science communication
    avatar for Nancy Hoebelheinrich

    Nancy Hoebelheinrich

    Principal, Knowledge Motifs LLC
    See my LinkedIn profile at: https://www.linkedin.com/in/nancy-hoebelheinrich-0576ba3
    avatar for Shelley Stall

    Shelley Stall

    Senior Director, Data Leadership, American Geophysical Union
    Shelley Stall is the Senior Director of Data Leadership at the American Geophysical Union. Shelley has more than two decades of experience working in high-volume, complex data management environments. She has helped organizations in not-for-profit, commercial, defense, and federal... Read More →


    Friday July 20, 2018 9:30am - 11:00am
    Madera

    9:30am

    Preparing Three Dimensional Data for Virtual and Augmented Reality
    Many scientists and researchers have been inspired by the VR and AR demos that they have seen at ESIP, AGU and elsewhere. A common question that surfaces is, “how do I get my data into VR?” In this session, a 3D data expert will share information about how to prepare data for immersive data visualization and ideas about automating the ingest of data into immersive visualization platforms.



    Speakers & Moderators
    avatar for Nicholas Hedley

    Nicholas Hedley

    Professor / Director - Spatial Interface Research Lab, Simon Fraser University
    I design, build, and deploy 3D interfaces for spatial applications. My primary technologies are: augmented reality, mixed reality, virtual reality, augmented virtuality. I am passionate about interfaces that deliver powerful experiences, and support elegant high-bandwidth interactions... Read More →
    avatar for Shayna Skolnik

    Shayna Skolnik

    Co-founder / CEO, Navteca
    Virtual reality, data visualization, science storytelling in VR, cloud computing, entrepreneurship, NASA ESTO Discover AQ project, | creativity + technology = awesome


    Friday July 20, 2018 9:30am - 11:00am
    Pima
    • Subject Jump In
    • Remote Participation Link https://global.gotomeeting.com/join/752150301
    • Remote Participation Access Code 752-150-301
    • Remote Participation Phone # (646) 749-3129 More phone numbers Australia: +61 2 9087 3604 Austria: +43 7 2081 5427 Belgium: +32 28 93 7018 Canada: +1 (647) 497-9391 Denmark: +45 32 72 03 82 Finland: +358 942 72 1060 France: +33 170 950 594 Germany: +49 692 5736 7317 Ireland: +353 16 572 651 Italy: +39 0 247 92 13 01 Netherlands: +31 202 251 017 New Zealand: +64 9 280 6302 Norway: +47 21 93 37 51 Spain: +34 932 75 2004 Sweden: +46 853 527 827 Switzerland: +41 435 5015 61 United Kingdom: +44 20 3713 5028
    • Tags Cloud Computing, Discovery, Education, Information Quality, Science Communication, Science Software, VR/AR

    9:30am

    Metadata Times, They Are Changing - New Capabilities and Applications
    We will cover new developments in metadata standards from a variety of communities: ISO, DataCite, DataOne, NASA.

    Speakers & Moderators
    avatar for Ted Habermann

    Ted Habermann

    Metadata 2020
    I am interested in all facets of metadata needed to discover, access, use, and understand data of any kind. Also evaluation and improvement of metadata collections, translation proofing. Ask me about the Metadata Game.
    avatar for Matt Jones

    Matt Jones

    Director of Informatics, UC Santa Barbara
    Data Federation | Open Science | Provenance and Semantics
    avatar for Tyler Stevens

    Tyler Stevens

    CMR Metadata Quality Team, NASA EED-2 / SGT



    Friday July 20, 2018 9:30am - 11:00am
    Sabino

    9:30am

    Community Ontology Repository Systems Administration Working Session

    Speakers & Moderators
    avatar for Annie Burgess

    Annie Burgess

    ESIP Lab Director, ESIP
    avatar for John Graybeal

    John Graybeal

    Technical Program Manager, CEDAR and BioPortal, Stanford University
    Metadata, semantics, and cool repositories for metadata and semantics. | Cool Earth Science (or biomedical) projects that will change the world. | Or at least, change the way we manage metadata about the world.
    BH

    Beth Huffer

    Lingua Logica


    Friday July 20, 2018 9:30am - 11:00am
    Ventana

    11:00am

    11:30am

    Data Model Cluster Meeting
    First meeting of the new Data Model cluster

    Speakers & Moderators
    avatar for Ethan Davis

    Ethan Davis

    UCAR Unidata


    Friday July 20, 2018 11:30am - 1:00pm
    Canyon B

    11:30am

    Data Stewardship Committee Meeting
    This session will present results from ongoing initiatives within the Data Stewardship Committee, and stimulate discussion on new project ideas. The agenda will include: 1) Discussion of the future update to the ESIP data citation guidelines (following from a Thursday unconference session), 2) Discussion of the "data risk factor categorization" project (following from a Tuesday working session), and 3) Discussion of the Data Stewardship Committee roadmap & goals

    Speakers & Moderators
    SH

    Sophie Hou

    Data Curation and Stewardship Coordinator, National Center for Atmospheric Research
    data management/curation/stewardship: including but not limited to data life cycle, policies, sustainability, education and training, data quality, usability.
    avatar for Matthew Mayernik

    Matthew Mayernik

    Project Scientist and Research Data Services Specialist, NCAR/UCAR Data Library
    Matt is a Project Scientist and Research Data Services Specialist in the NCAR/UCAR Library. His work is focused on research and service development related to research data curation. His research interests include metadata practices and standards, data curation education, data citation... Read More →


    Friday July 20, 2018 11:30am - 1:00pm
    Madera

    11:30am

    Information management code registry for earth and environmental sciences
    Earth and environmental scientists and data managers write significant amounts of code each year for large scale data manipulation. However, publishing and sharing this code is not a common practice and is hampered by the lack of thematic code registries that are designed to make code easily discoverable and reusable. In an exploratory session at last year’s ESIP summer meeting we discussed community practices, existing repositories, challenges, and recommendations for what might be termed ‘information management’ code publication, i.e., code developed to prepare data for a specific research question.

    This information management code registry has been implemented using OntoSoft (http://imcr.ontosoft.org/# ) and an initial hackathon conducted. It’s focus is on making code more discoverable that addresses common procedures that earth and environmental information managers encounter when organizing, cleaning, manipulating, documenting, and archiving data sets. It will live in the niche between scientific analysis code and short code snippets as found in Stack Overflow. It will be a community maintained resource containing everything from example information management code to programs with multiple functions that are generalized to be easily reused. In this session we aim to build a community and discuss best practices for publishing code, code metadata, and repository governance/maintenance. After a short introduction to the registry in OntoSoft and a report on lessons learned from the hackathon we will launch into discussion of best practices, governance and a wish-list of code priorities.

    Agenda (bit.ly/imcragenda).

    Session notes (bit.ly/imcrnotes).

    Speakers & Moderators
    avatar for Colin Smith

    Colin Smith

    Data manager, Environmental Data Initiative (EDI)
    I work on accelerating the archive and reuse of data in ecological science. My interests are in software development and data harmonization.


    Agenda docx

    Friday July 20, 2018 11:30am - 1:00pm
    Pima
    • Subject Skim the Surface, Jump In
    • Remote Participation Link https://global.gotomeeting.com/join/752150301
    • Remote Participation Access Code 752-150-301
    • Remote Participation Phone # (646) 749-3129 More phone numbers Australia: +61 2 9087 3604 Austria: +43 7 2081 5427 Belgium: +32 28 93 7018 Canada: +1 (647) 497-9391 Denmark: +45 32 72 03 82 Finland: +358 942 72 1060 France: +33 170 950 594 Germany: +49 692 5736 7317 Ireland: +353 16 572 651 Italy: +39 0 247 92 13 01 Netherlands: +31 202 251 017 New Zealand: +64 9 280 6302 Norway: +47 21 93 37 51 Spain: +34 932 75 2004 Sweden: +46 853 527 827 Switzerland: +41 435 5015 61 United Kingdom: +44 20 3713 5028
    • Tags Discovery, Documentation, Science Software, Sustainable Data Management

    11:30am

    HDF Townhall
    Data in HDF continues to play an important role for Earth Scientists in the U.S. and around the world. The HDF Group will update ESIP members on interesting projects that have come to fruition during the last year, including the TerraFusion project which brings the entire history of Terra as well as recent releases of HDF5. We will also demonstrate how HDF tools support HDF-EOS data from product design to production and standards compliance testing to user support.

    Suggestion to include:

    Potential benefits of shuffle
    Third-party compression filters

    Speakers & Moderators

    Friday July 20, 2018 11:30am - 1:00pm
    Sabino

    11:30am

    ESIP Semantic Technology Committee Business Meeting
    We will use this session to plan activities for the rest of the year, and to plan for the 3rd annual Geosemantics Symposium.

    https://github.com/ESIPFed/cor

    Speakers & Moderators

    Friday July 20, 2018 11:30am - 1:00pm
    Ventana

    1:00pm

    Lunch
    Friday July 20, 2018 1:00pm - 2:00pm
    Grand Canyon Ballroom