Skip to main content
Dataset Overview | National Centers for Environmental Information (NCEI)

Metagenomic, metatranscriptomics and 16S rRNA gene sequence data from diel sampling at Groves Creek Marsh, Skidaway Island, GA during July 2014 (NCEI Accession 0278832)

browse graphicPreview graphic
This dataset contains chemical and physical data collected at Groves Creek - SkIO during deployment Groves_Creek_2013-2015 from 2014-07-16 to 2014-07-17. These data include Total Dissolved Nitrogen, density, depth, dissolved organic Carbon, salinity calculated from CTD primary sensors, and water temperature. The instruments used to collect these data include Liquid Scintillation Counter, Niskin bottle, Nutrient Autoanalyzer, Shimadzu TOC-V Analyzer, Spectrophotometer, YSI Professional Plus Multi-Parameter Probe, and YSI Sonde 6-Series. These data were collected by Alison Buchan of University of Tennessee as part of the "Collaborative Research: Marine priming effect - molecular mechanisms for the biomineralization of terrigenous dissolved organic matter in the ocean (Marine priming effect)" project. The Biological and Chemical Oceanography Data Management Office (BCO-DMO) submitted these data to NCEI on 2019-03-19.

The following is the text of the dataset description provided by BCO-DMO:

Metagenomic, metatranscriptomics and 16S rRNA gene sequence data from Groves Creek diel sampling

Dataset Description:
Groves Creek Marsh (31.972° N, 81.028° W), a temperate salt marsh fringing Skidaway Island, GA served as the field site for this study. During July 16-17, 2014, samples were collected every two hours and four minutes to evenly sample across two tidal cycles and one diurnal cycle.

This dataset contains the environmental observations related to the genetic data found at the Joint Genome Institute (JGI) at the following url: https://genome.jgi.doe.gov/portal/Geometdatstreams/Geometdatstreams.info.html?core=genome&query=Geometdatstreams&searchIn=Anything&searchType=Proposals&showAll=false&externallySequenced=true&sortBy=displayNameStr&showRestricted=true&showOnlyPublished=false&showSuperseded=true&sortOrder=asc&rawQuery=false&showFungalOnly=false&activateHighlights=false&programName=all&programYear=all&superkingdom=--any--&status=--any--&scientificProgram=--any--&productCategory=--any--&start=0&rows=50&page=1 (last visited: 2019-03-18)
  • Cite as: Buchan, Alison (2023). Metagenomic, metatranscriptomics and 16S rRNA gene sequence data from diel sampling at Groves Creek Marsh, Skidaway Island, GA during July 2014 (NCEI Accession 0278832). [indicate subset used]. NOAA National Centers for Environmental Information. Dataset. https://www.ncei.noaa.gov/archive/accession/0278832. Accessed [date].
gov.noaa.nodc:0278832
Download Data
  • HTTPS (download)
    Navigate directly to the URL for data access and direct download.
  • FTP (download)
    These data are available through the File Transfer Protocol (FTP). FTP is no longer supported by most internet browsers. You may copy and paste the FTP link to the data into an FTP client (e.g., FileZilla or WinSCP).
Distribution Formats
  • TSV
Ordering Instructions Contact NCEI for other distribution options and instructions.
Distributor NOAA National Centers for Environmental Information
+1-301-713-3277
NCEI.Info@noaa.gov
Dataset Point of Contact NOAA National Centers for Environmental Information
ncei.info@noaa.gov
Time Period 2014-07-16 to 2014-07-17
Spatial Bounding Box Coordinates
West: -81.028
East: -81.028
South: 31.972
North: 31.972
Spatial Coverage Map
General Documentation
Associated Resources
  • Biological, chemical, physical, biogeochemical, ecological, environmental and other data collected from around the world during historical and contemporary periods of biological and chemical oceanographic exploration and research managed and submitted by the Biological and Chemical Oceanography Data Management Office (BCO-DMO)
    • NCEI Collection
      Navigate directly to the URL for data access and direct download.
  • Buchan, A. (2019) Metagenomic, metatranscriptomics and 16S rRNA gene sequence data from diel sampling at Groves Creek Marsh, Skidaway Island, GA during July 2014. Biological and Chemical Oceanography Data Management Office (BCO-DMO). Dataset version 2019-03-19. https://doi.org/10.1575/1912/bco-dmo.762443.1
  • Parent ID (indicates this dataset is related to other data):
    • gov.noaa.nodc:BCO-DMO
Publication Dates
  • publication: 2023-05-27
Data Presentation Form Digital table - digital representation of facts or figures systematically displayed, especially in columns
Dataset Progress Status Complete - production of the data has been completed
Historical archive - data has been stored in an offline storage facility
Data Update Frequency As needed
Supplemental Information
Acquisition Description:
Sample collection
Surface water samples were collected from approximately 1 m depth using a hand deployed Niskin bottle. Samples for dissolved constituents (dissolved organic carbon, colored dissolved organic matter, and nutrient analyses) were filtered on site through 0.2 m Polycap filters within minutes of collection and then transported to the laboratory for further processing. For cell counts by flow cytometry, samples were transported to the laboratory and fixed using 25% glutaraldehyde. For additional microbial analyses (microbial biomass collection for DNA and RNA extractions and bacterial production) were returned to the laboratory, which was within 10 minutes’ drive of the field site. Salinity was measured for discrete samples collected in the field using a handheld multiparameter probe (YSI, Pro2030). Depth was recorded using a YSI 600OMS V2 Optical Monitoring Sonde deployed on the creek bed.

Microbial community analysis sample collection
Planktonic microbial cells from surface water samples were collected by filtration. Water was pre-filtered through a GF/D glass fiber filter (~2.7 μM pore size, Whatman, GE Healthcare Life Sciences, Marlborough, MA); 500 mL of the filtrate was passed through a 0.22 μM pore size, 47 mm diameter filter (Millipore, Burlington, MA). Filtration was completed within 30 min of sample collection. After filtration all filters were placed in cryovials and flash frozen in liquid nitrogen. The samples were stored at -80 C until processing.

Microbial community analysis sample processing
Nucleic acids were extracted from samples following standard methodology. Briefly, for DNA the filters were thawed and placed in a 2 mL tube with 0.3 g glass and zirconia beads (0.2 g glass and 0.1 g zirconia), 0.75 mL CTAB extraction buffer, 0.75 mL phenol:chloroform:isoamyl alcohol (25:24:1, pH 8), internal standards, proteinase K, 10% SDS, and lysozyme for DNA extractions. Samples were vortexed for 10 min to lyse the cells. For RNA extraction, sample tubes were centrifuged for 10 min at 10,000 rpm and 4° C. The lysates were transferred to a sterile 1.5 mL microcentrifuge tube and mixed with 0.75 mL chloroform:isoamyl alcohol (24:1). The aqueous phase was added to a sterile 1.5 mL microcentrifuge with MgCl2, sodium acetate, and isopropanol. This solution was incubated at -80° C for 1.5 hours and then centrifuged at 4° C for 45 min at 10,000 rpm. The supernatant was discarded, and the RNA was washed with 70% EtOH twice. Following RNA extraction Turbo DNase was used to remove residual DNA. For metagenomic samples the lysate was centrifuged at 5,000 rpm for 5 min and washed twice with 0.5 mL of chloroform:isoamyl alcohol by centrifugation at 15,000 rpm for 5 min. The upper aqueous phase was incubated with isopropanol at room temperature for 2 hrs. The DNA was precipitated by centrifugation at 10,000 rpm for an hour and washed with 70% EtOH twice.

All sequencing, assembly, and annotation was performed by the DOE Joint Genome Institute (JGI). JGI generated 16S rRNA libraries, metagenomes, and metatranscriptomes. Plate-based DNA library preparation for Illumina sequencing was performed on the PerkinElmer Sciclone NGS robotic liquid handling system using Kapa Biosystems library preparation kit. DNA was sheared to 300 base pairs (bp) using the Covaris LE220 focused-ultrasonicator and size selected using SPRI beads (Beckman Coulter). The fragments were treated with end-repair, A-tailing, and ligation of Illumina compatible adapters (IDT, Inc) containing a unique molecular index barcode for each sample library. qPCR was used to determine the concentration of the libraries and were sequenced on the Illumina HiSeq-2500 to yield 150 bp paired-end reads at the DOE Joint Genome Institute. Quality filtered metagenomic sequences for each sample were assembled with metaSPAdes (version 3.10.1; and all contigs >200 bp were uploaded and annotated by the Integrated Microbial Genomes (IMG) pipeline. For metatranscriptomes, a plate-based RNA sample preparation was performed on the PerkinElmer Sciclone NGS robotic liquid handling system using the Illumina Ribo-Zero rRNA Removal Kit (bacteria) and the TruSeq Stranded Total RNA HT sample prep kit following the protocol outlined by Illumina. Total RNA starting material consisted of 100 ng per sample and included 10 cycles of PCR for library amplification. Illumina sequencing was performed as described for metagenome samples.

Quality filtered metatranscriptomic sequences for each sample were assembled with Megahit (version 1.10.6), and all contigs > 200 bp were annotated as described for the metagenome samples. Datasets which had assemblies for which the N50 was greater than three standard deviations from mean were not included in further analyses (Supplemental Tables 1 and 2) Resultant assemblies were combined with coding sequences (CDS) using bedtools2 (version 2.27.0) in order to generate an assembly with CDS embedded. Quality controlled raw reads were mapped to the assembly with gene features using bowtie2 (version 2.2.9). Coverage information on the number of reads mapping to each contig was generated using pileup in the BBmap suite of tools. The coverage information was used to normalize read counts to account for the length of reads and the length of CDS. Read counts within KEGG ortholog groups (KO) were summed and normalized as read counts per million mapped to KO-annotated contigs (genes per million [GPM], transcripts per million [TPM]). GPM and TPM were also used in taxonomic analyses.
Purpose This dataset is available to the public for a wide variety of uses including scientific research and analysis.
Use Limitations
  • accessLevel: Public
  • Distribution liability: NOAA and NCEI make no warranty, expressed or implied, regarding these data, nor does the fact of distribution constitute such a warranty. NOAA and NCEI cannot assume liability for any damages caused by any errors or omissions in these data. If appropriate, NCEI can only certify that the data it distributes are an authentic copy of the records that were accepted for inclusion in the NCEI archives.
Dataset Citation
  • Cite as: Buchan, Alison (2023). Metagenomic, metatranscriptomics and 16S rRNA gene sequence data from diel sampling at Groves Creek Marsh, Skidaway Island, GA during July 2014 (NCEI Accession 0278832). [indicate subset used]. NOAA National Centers for Environmental Information. Dataset. https://www.ncei.noaa.gov/archive/accession/0278832. Accessed [date].
Cited Authors
Principal Investigators
Contributors
Resource Providers
Points of Contact
Publishers
Acknowledgments
Theme keywords NODC DATA TYPES THESAURUS NODC OBSERVATION TYPES THESAURUS WMO_CategoryCode
  • oceanography
BCO-DMO Standard Parameters Global Change Master Directory (GCMD) Science Keywords Originator Parameter Names
Data Center keywords NODC COLLECTING INSTITUTION NAMES THESAURUS NODC SUBMITTING INSTITUTION NAMES THESAURUS Global Change Master Directory (GCMD) Data Center Keywords
Platform keywords BCO-DMO Platform Names
Instrument keywords NODC INSTRUMENT TYPES THESAURUS BCO-DMO Standard Instruments Global Change Master Directory (GCMD) Instrument Keywords Originator Instrument Names
Place keywords Provider Place Names
Project keywords BCO-DMO Standard Projects Provider Deployment IDs Provider Funding Award Information
Keywords NCEI ACCESSION NUMBER
Use Constraints
  • Cite as: Buchan, Alison (2023). Metagenomic, metatranscriptomics and 16S rRNA gene sequence data from diel sampling at Groves Creek Marsh, Skidaway Island, GA during July 2014 (NCEI Accession 0278832). [indicate subset used]. NOAA National Centers for Environmental Information. Dataset. https://www.ncei.noaa.gov/archive/accession/0278832. Accessed [date].
Data License
Access Constraints
  • Use liability: NOAA and NCEI cannot provide any warranty as to the accuracy, reliability, or completeness of furnished data. Users assume responsibility to determine the usability of these data. The user is responsible for the results of any application of this data for other than its intended purpose.
Fees
  • In most cases, electronic downloads of the data are free. However, fees may apply for custom orders, data certifications, copies of analog materials, and data distribution on physical media.
Lineage information for: dataset
Processing Steps
  • 2023-05-27T05:11:30Z - NCEI Accession 0278832 v1.1 was published.
Output Datasets
Acquisition Information (collection)
Instrument
  • multi-parameter water quality sensor
  • Niskin bottle
  • nutrient autoanalyzer
  • scintillation counter
  • spectrophotometer
Last Modified: 2024-05-31T15:15:28Z
For questions about the information on this page, please email: ncei.info@noaa.gov