Skip to main content
NSF NEON, Operated by Battelle

Main navigation

  • About Us
    • Overview
      • Spatial and Temporal Design
      • History
    • Vision and Management
    • Advisory Groups
      • Science, Technology & Education Advisory Committee
      • Technical Working Groups (TWGs)
    • FAQ
    • Contact Us
      • Contact NEON Biorepository
      • Field Offices
    • User Accounts
    • Staff
    • Code of Conduct

    About Us

  • Data & Samples
    • Data Portal
      • Explore Data Products
      • Data Availability Charts
      • Spatial Data & Maps
      • Document Library
      • API & GraphQL
      • Prototype Data
      • External Lab Data Ingest (restricted)
    • Data Themes
      • Biogeochemistry
      • Ecohydrology
      • Land Cover and Processes
      • Organisms, Populations, and Communities
    • Samples & Specimens
      • Discover and Use NEON Samples
        • Sample Types
        • Sample Repositories
        • Sample Explorer
        • Megapit and Distributed Initial Characterization Soil Archives
      • Sample Processing
      • Sample Quality
      • Taxonomic Lists
    • Collection Methods
      • Protocols & Standardized Methods
      • Airborne Remote Sensing
        • Flight Box Design
        • Flight Schedules and Coverage
        • Daily Flight Reports
          • AOP Flight Report Sign Up
        • Camera
        • Imaging Spectrometer
        • Lidar
      • Automated Instruments
        • Site Level Sampling Design
        • Sensor Collection Frequency
        • Instrumented Collection Types
          • Meteorology
          • Phenocams
          • Soil Sensors
          • Ground Water
          • Surface Water
      • Observational Sampling
        • Site Level Sampling Design
        • Sampling Schedules
        • Observation Types
          • Aquatic Organisms
            • Aquatic Microbes
            • Fish
            • Macroinvertebrates & Zooplankton
            • Periphyton, Phytoplankton, and Aquatic Plants
          • Terrestrial Organisms
            • Birds
            • Ground Beetles
            • Mosquitoes
            • Small Mammals
            • Soil Microbes
            • Terrestrial Plants
            • Ticks
          • Hydrology & Geomorphology
            • Discharge
            • Geomorphology
          • Biogeochemistry
          • DNA Sequences
          • Pathogens
          • Sediments
          • Soils
            • Soil Descriptions
        • Optimizing the Observational Sampling Designs
    • Data Notifications
    • Data Guidelines and Policies
      • Acknowledging and Citing NEON
      • Publishing Research Outputs
      • Usage Policies
    • Data Management
      • Data Availability
      • Data Formats and Conventions
      • Data Processing
      • Data Quality
      • Data Product Bundles
      • Data Product Revisions and Releases
        • Release 2021
        • Release 2022
        • Release 2023
        • Release 2024
        • Release-2025
      • NEON and Google
      • Externally Hosted Data

    Data & Samples

  • Field Sites
    • About Field Sites and Domains
    • Explore Field Sites
    • Site Management Data Product

    Field Sites

  • Impact
    • Observatory Blog
    • Case Studies
    • Papers & Publications
    • Newsroom
      • NEON in the News
      • Newsletter Archive
      • Newsletter Sign Up

    Impact

  • Resources
    • Getting Started with NEON Data & Resources
    • Documents and Communication Resources
      • Papers & Publications
      • Document Library
      • Outreach Materials
    • Code Hub
      • Code Resources Guidelines
      • Code Resources Submission
      • NEON's GitHub Organization Homepage
    • Learning Hub
      • Science Videos
      • Tutorials
      • Workshops & Courses
      • Teaching Modules
    • Research Support Services
      • Field Site Coordination
      • Letters of Support
      • Mobile Deployment Platforms
      • Permits and Permissions
      • AOP Flight Campaigns
      • Research Support FAQs
      • Research Support Projects
    • Funding Opportunities

    Resources

  • Get Involved
    • Advisory Groups
      • Science, Technology & Education Advisory Committee
      • Technical Working Groups
    • Upcoming Events
    • NEON Ambassador Program
      • Exploring NEON-Derived Data Products Workshop Series
    • Research and Collaborations
      • Environmental Data Science Innovation and Inclusion Lab
      • Collaboration with DOE BER User Facilities and Programs
      • EFI-NEON Ecological Forecasting Challenge
      • NEON Great Lakes User Group
      • NEON Science Summit
      • NCAR-NEON-Community Collaborations
        • NCAR-NEON Community Steering Committee
    • Community Engagement
      • How Community Feedback Impacts NEON Operations
    • Science Seminars and Data Skills Webinars
      • Past Years
    • Work Opportunities
      • Careers
      • Seasonal Fieldwork
      • Internships
        • Intern Alumni
    • Partners

    Get Involved

  • My Account
  • Search

Search

Impact

  • Observatory Blog
  • Case Studies
  • Papers & Publications
  • Newsroom

Breadcrumb

  1. Impact
  2. Observatory Blog
  3. Answer big ecological questions using big data skills

Answer big ecological questions using big data skills

May 13, 2015

NEON workshop

Interested in learning how to work with big data in R?

Sign up for A Hands-On Primer for Working with Big Data in R: Introduction to Hierarchical Data Formats, LiDAR Data & Efficient Data Visualization on Sunday August 9, 2015 at the ESA Centennial Annual Meeting in Baltimore, MD. 

Big ecological questions require big data

To ask and answer ecological questions about changes in diverse environments over large areas and long periods of time requires big data. Big data generally refer to datasets so large and complex that traditional processing applications are inadequate; however, big data present unique challenges and opportunities to the scientific research community. For example, to effectively determine the effects of development on stream water quality across a region, standardized and integrated data are needed to characterize land cover and population changes, among other things.

“Big data generally refer to massive volumes of data not readily handled by the usual data tools and practices and present unprecedented opportunities for advancing science and informing resource management through data-intensive approaches.” -Hampton et al. (2013) Big data and the future of ecology. 

Working with big data require specific skills

Working with big data in an efficient way requires a set of skills that are new to many scientists. Data formats designed to handle larger datasets, such as the hierarchical data format (HDF5):

  • Provide more efficient ways to store large datasets that might contain thousands to millions of records or hypercubes of images;
  • Allow users to store multi-dimensional and heterogeneous datasets needed to answer cross-cutting ecological questions; and
  • Provide tools to compress and/or parse data for analysis.

Data with spatial attributes

While big data formats maximize data analysis efficiency, using them involves specific sets of skills and libraries for commonly used tools like R and Python. Other data types, like remote sensing data, which include lidar and hyperspectral imagery, are necessary for measuring changes in land cover and other attributes over broad areas and through time. Working with these data that have spatial attributes requires understanding of:

  • Unique spatial and hierarchical data formats;
  • Tools and libraries required to work with data - many of which are free and open source; and
  • Metadata associated with the data, to ensure that analysis outcomes are scaled and located properly.

Automated and reproducible workflows

Working with big data require automated and reproducible workflows. Crunching through thousands or even millions of data points by hand may take weeks, months or years; repeating this type of manual analysis is difficult and time-consuming. Developing automated workflows that process data using coding tools like R or Python facilitates efficient, reproducible workflows. In addition, many journals now require submission of both data and code prior to publication.

NEON Data Skills at ESA 2015

NEON scientists Leah Wasser, Natalie Robinson, Claire Lunch, Christine Laney, Kate Thibault and Sarah Elmendorf have been building, testing, delivering and improving upon a suite of data tutorials that cover big data topics including:

  • Working with time series and spatial data stored in the HDF5 format in R
  • Learn about the HDF5 file format using a free HDF5 viewer
  • Learn key commands and libraries needed to create and work with HDF5 files in R
  • Visualization of time series data stored in HDF5 format in R
  • Working with LiDAR-derived raster data in R
  • Working with hyperspectral imagery in R

NEON is delivering this content in collaboration with SESYNC and Data Carpentry as a full-day pre-conference workshop at the 2015 ESA Annual Meeting. Learn about the half-day workshop at ESA 2014 that prompted 2015’s full-day workshop.

NEON scientists are also hosting a free lunchtime Going 'On the Grid' Spatial Data workshop on Thursday August 13, 2015. This workshop covers issues of uncertainty when converting vector point data to raster or gridded formats. As scientists, many of the observation data we work with are for specific point locations on the ground. However, we often want to interpolate our observations continuously across larger areas, a process sometimes called “gridding”. NEON will lead a discussion and live demonstration that explains how different gridding methods can yield different results in the output rasters--and more importantly, how that might impact the results of your data analysis.

 

Share

Related Posts:

A Chance for Fire Research and Recovery at a NEON Desert Site

January 24, 2025

Burned cactus

New versions of neonUtilities in both R and Python

December 18, 2024

G7 Summit: NEON/Battelle Elevate U.S. Ecology and Infrastructure

November 25, 2024

U.S. Attendees to G7 LRI conference
NSF NEON, Operated by Battelle

Follow Us:

Join Our Newsletter

Get updates on events, opportunities, and how NEON is being used today.

Subscribe Now

Footer

  • About Us
  • Newsroom
  • Contact Us
  • Terms & Conditions
  • Careers
  • Code of Conduct

Copyright © Battelle, 2025

The National Ecological Observatory Network is a major facility fully funded by the U.S. National Science Foundation.

Any opinions, findings and conclusions or recommendations expressed in this material do not necessarily reflect the views of the U.S. National Science Foundation.