The National Ecological Observatory Network (NEON) is hosting a 3-day lesson-building hackathon to develop a suite of NEON/ Data Carpentry data tutorials and corresponding assessment instruments. The tutorials and assessment instruments will be used to teach fundamental big data skills needed to work efficiently with large spatio-temporal data using open tools, such as R, Python and postgres SQL.
A hackathon to develop spatio-temporal data analysis tutorials
The goal of the hackathon is to develop lessons that teach methods for spatio-temporal data analysis. The group will identify teaching needs and address technological gaps by building tutorials and supporting materials. Materials developed during this hackathon will be openly licensed under Creative Commons and may be taught in multi-day NEON/ Data Carpentry workshops, adapted for classroom use and repurposed for self-paced online tutorials. Topics to be developed may include:
- Automating big data workflows to increase efficiency
- Use of data formats such as HDF5 that support large, heterogeneous data
- Working with metadata to support automated workflows
- Key tools, libraries and concepts needed to automate NEON data and spatial data workflows in platforms like Rstats and python
- Tools and methods for big data visualization
Assembling scientists, educators and developers
The NEON hackathon will assemble a diverse and interdisciplinary group of scientists, educators and developers with various levels of data analysis, visualization and management experience and expertise, including applicants with expertise in bioinformatics. The organizing committee includes:
- Leah Wasser, NEON
- Tracy Teal, Data Carpentry
- Mike Smorul, SESYNC
- Jason Williams, iPlant
The advantage of big data skills for science
NEON is a continental-scale observatory network that provides standardized, integrated, high-quality ecological data to support ecological understanding and forecasting. However, working with spatio-temporal NEON data requires a unique set of skills that are not typically taught in science curricula. Advantages of data skill training include:
- Learning to work with spatio-temporal datasets that contribute to large-scale ecological research and understanding
- Efficient and automated research workflows
- Improved accuracy of analysis conclusions
Limited funding is available for selected applicants.
We hope to attract participants with a range of expertise in the areas of lesson development, especially related to data informatics, management and visualization. Some experience with developing lessons related to teaching data informatics, analysis and management is required to attend this event. Women and underrepresented minorities are especially encouraged to apply.
This hackathon resulted in four lesson series that are used for workshops provided by NEON and by Data Carpentry. The lessons are hosted on both the NEON Data Skills site and the Data Carpentry site. .
NEON Data Skills Tutorials
- Introduction to Working With Spatio-Temporal Data and Data Management
- Introduction to Working with Vector Data in R
- Introduction to Working with Raster Data in R
- Introduction to Working with Time Series Data in Text format in R
For the overview of the Data Carpentry Geospatial Data Workshop click here.
The hackathon also resulted in several lesson outlines that have not yet been completed. These are in rough draft form and contributions are welcomed.
The Spatio-Temporal Data
The data used for the lesson building includes NEON and other complementary data.