Skip to main content
Dataset Overview | National Centers for Environmental Information (NCEI)

Water quality data, gene copy number counts, and 16S analyses from Rhode River samples collected at SERC during 2017 and 2018 (NCEI Accession 0288317)

browse graphicGraphic not available.
This dataset contains biological, chemical, optical, and physical data collected from 2017-05-17 to 2018-12-27. These data include chlorophyll a, conductivity, depth, dissolved Oxygen, fluorescence, pH, salinity calculated from CTD primary sensors, turbidity, water pressure, and water temperature. The instruments used to collect these data include Automated DNA Sequencer, YSI Professional Plus Multi-Parameter Probe, and qPCR Thermal Cycler. These data were collected by Sarah Preheim of Johns Hopkins University as part of the "EAGER: High-throughput, culture-independent technique identifying cyanobacteria infections to improve understanding of carbon biogeochemical cycling (identify cyano infections)" project. The Biological and Chemical Oceanography Data Management Office (BCO-DMO) submitted these data to NCEI on 2020-12-07.

The following is the text of the dataset description provided by BCO-DMO:

Water quality data, gene copy number counts, and 16S analyses from Rhode River samples collected at SERC

Dataset Description:
Water quality data, gene copy number counts, and 16S analyses from water samples collected from the Rhode River at the Smithsonian Environmental Research Center (SERC) in Edgewater, MD during 2017 and 2018.
  • Cite as: Preheim, Sarah (2024). Water quality data, gene copy number counts, and 16S analyses from Rhode River samples collected at SERC during 2017 and 2018 (NCEI Accession 0288317). [indicate subset used]. NOAA National Centers for Environmental Information. Dataset. https://www.ncei.noaa.gov/archive/accession/0288317. Accessed [date].
gov.noaa.nodc:0288317
Download Data
  • HTTPS (download)
    Navigate directly to the URL for data access and direct download.
  • FTP (download)
    These data are available through the File Transfer Protocol (FTP). FTP is no longer supported by most internet browsers. You may copy and paste the FTP link to the data into an FTP client (e.g., FileZilla or WinSCP).
Distribution Formats
  • TSV
Ordering Instructions Contact NCEI for other distribution options and instructions.
Distributor NOAA National Centers for Environmental Information
+1-301-713-3277
NCEI.Info@noaa.gov
Dataset Point of Contact NOAA National Centers for Environmental Information
ncei.info@noaa.gov
Time Period 2017-05-17 to 2018-12-27
Spatial Bounding Box Coordinates
West: -76.5418
East: -76.5402
South: 38.8869
North: 38.8869
Spatial Coverage Map
General Documentation
Associated Resources
  • Biological, chemical, physical, biogeochemical, ecological, environmental and other data collected from around the world during historical and contemporary periods of biological and chemical oceanographic exploration and research managed and submitted by the Biological and Chemical Oceanography Data Management Office (BCO-DMO)
    • NCEI Collection
      Navigate directly to the URL for data access and direct download.
  • Preheim, S. (2020) Water quality data, gene copy number counts, and 16S analyses from Rhode River samples collected at SERC during 2017 and 2018. Biological and Chemical Oceanography Data Management Office (BCO-DMO). (Version 2) Version Date 2020-12-03. https://doi.org/10.26008/1912/bco-dmo.757405.2
  • Parent ID (indicates this dataset is related to other data):
    • gov.noaa.nodc:BCO-DMO
Publication Dates
  • publication: 2024-02-02
Data Presentation Form Digital table - digital representation of facts or figures systematically displayed, especially in columns
Dataset Progress Status Complete - production of the data has been completed
Historical archive - data has been stored in an offline storage facility
Data Update Frequency As needed
Supplemental Information
Acquisition Description:
Surface water samples were collected for subsequent microbial community analyses via PCR and shotgun metagenomics. A YSI EXO2 water quality sonde operated by the Smithsonian Environmental Research Center was used to collect physicochemical conditions at time of sample collection.

Shotgun metagenomic DNA and 16S rRNA microbial genes were amplified as described in Sakowski et al. (in revision). Metagenomic DNA was prepped using the Nextera Flex library prep kit. 16S rRNA libraries were prepped by V3-V4 PCR amplification. Both shotgun metagenome and 16S libraries were sequenced on an Illumina MiSeq.

qPCR of 16S rRNA genes and Viral Ribonucleotide Reductase Genes:
Bacterial and viral abundances were estimated by quantitative PCR. For Ribonucleotide Reductase (RNR) quantification, an environmental RNR amplicon was first cloned into chemically competent Escherichia coli cells using the Zero Blunt PCR Cloning Kit (Thermo Scientific) following the manufacturer's protocol. Environmental samples were quantitated for 16S rRNA and RNR gene copy numbers by comparing to a serial dilution of the cloned E. coli cells at known concentrations. All standards and environmental samples were run in triplicate. Three microliters of sample were combined with UltraPure molecular grade water (Thermo, Inc.), SsoAdvanced Universal SYBR Green Supermix (1x final concentration, Bio-Rad Laboratories, Inc.), Forward primer (0.3 mM final concentration), and Reverse primer (0.3 mM final concentration) to a final volume of 25 mL. Samples were amplified on a CFX96 Real-Time PCR Detection System (Bio-Rad Laboratories, Inc.) with the following conditions: denaturing at 98°C for 10 minutes; 45 cycles of denaturing at 98°C for 10 seconds, annealing at 52°C for 30 seconds, and extension at 72°C for 45 seconds; and a final extension of 72°C for 5 minutes.

16S rRNA amplification and sequencing:
16S rRNA genes were amplified from surface water samples in a 25 mL PCR reaction with the following conditions: three microliters of column-purified DNA were combined with UltraPure molecular grade water (Thermo, Inc.), 10X buffer (1x final concentration), dNTPs (0.1mM each final concentration), 16S forward primer 27F (0.3 mM final concentration), 16S reverse primer PE_16S_V4_E786_R (0.3 mM final concentration), bovine serum albumin (0.02 mg/mL final concentration), and Phusion High-Fidelity DNA Polymerase (0.5U; New England BioLabs, Inc. PCR reactions were combined with 150 mL of 4% UMIL EM90 oil (4% UMIL EM90 oil, 0.05% TritonX-100 v/v in mineral oil; Universal Preserv-A-Chem, Inc.) and emulsified by vortexing at max speed (~2,700 rpm) for one minute on a Vortex Genie 2 (MoBio). Emulsions were loaded as 50 mL aliquots and amplified with the following conditions: denaturation at 94°C for 3 minutes; 33 cycles of denaturation at 94°C for 10 seconds, annealing at 54°C for 30 seconds, and extension at 72°C for 45 seconds; and a final extension of 72°C for 5 minutes (C1000, BioRad Labs., Inc.).

PCR oil emulsions were broken with isobutanol. Briefly, PCR aliquots were pooled in a 1.5mL microcentrifuge tube and combined with 100 uL of sterile 5M NaCl solution and 1 mL of isobutanol. Samples were vortexed briefly to mix and centrifuged at 16,000 x g for 1 minute. The bottom aqueous layer was retained, and DNA was purified by spin column purification (Zymo, Inc.). DNA was eluted in 20 uL of Tris-HCl.

Purified DNA was run on a 1.5% agarose gel (UltraPure Agarose, ThermoFisher Scientific). 16S rRNA gene bands were visualized under blue light excitation, extracted, and gel purified (Zymo, Inc.) Purified DNA was eluted into 20 mL of Tris-HCl. Barcodes and Illumina adapters were added to 16S rRNA gene amplicon products in two subsequent limited PCR steps. Barcodes were added as follows: two microliters of purified DNA were combined with UltraPure molecular grade water (Thermo, Inc.), 10X buffer (1x final concentration), dNTPs (0.1mM each final concentration), 16S forward primer PE_16S_V4_U515F (0.3 mM final concentration), 16S rRNA gene reverse primer with 8-mer barcodes PE_IV_XXX (0.3 mM final concentration), and Phusion High-Fidelity DNA Polymerase (0.5U; New England BioLabs, Inc.). Samples were amplified with the following conditions: denaturing at 98°C for 30 seconds; 8 cycles of denaturing at 98°C for 10 seconds, annealing at 54°C for 30 seconds, and extension at 72°C for 45 seconds; and a final extension of 72°C for 5 minutes. DNA was purified by spin column purification (Zymo, Inc.) and eluted into 20 mL Tris-HCl. Illumina adapters were then added as above with the following primers: Illumina adapter forward primer PE-III-PCR-F (0.3 mM final concentration) and Illumina adapter reverse primer Barcode_Rev (0.3 mM final concentration). Samples were amplified with the following conditions: denaturing at 98°C for 30 seconds; 5 cycles of denaturing at 98°C for 10 seconds, annealing at 54°C for 30 seconds, and extension at 72°C for 45 seconds; and a final extension of 72°C for 5 minutes. DNA was purified by spin column purification (Zymo, Inc.) and eluted into 20 mL Tris-HCl. 16S rRNA gene amplicon libraries were sequenced on an Illumina MiSeq (2 x 300 bp) at the Genetic Core Research Facility at Johns Hopkins University.

Shotgun metagenomic library preparation and sequencing:
Surface water samples were filtered through a 0.2µm PES filter. DNA was extracted from filters for shotgun sequencing from water samples with the DNeasy PowerWater kit (Qiagen) following the manufacturer’s protocol with the following amendment: 20 µL of proteinase K was combined with 1 mL of solution PW1 in the bead tube. The bead tube was incubated at 65°C for ten minutes prior to bead beating. Libraries were prepared with the Nextera DNA Flex Library Prep kit (Illumina, Inc.) following the manufacture's protocol and sequenced on an Illumina MiSeq (2 x 300 bp) at the Genetic Core Research Facility at Johns Hopkins University.

epicPCR of environmental samples:
Surface water glycerol samples (25% v/v) were thawed on ice and one mL was added to three replicate 1.5 mL microcentrifuge tubes per sample. Samples were centrifuged at 25,000 x g for 10 minutes and resuspended after supernatant removal to reduce free viral particles. Thirty microliters of each sample was combined with UltraPure molecular grade water (Thermo, Inc.), 10X buffer (1x final concentration), dNTPs (0.1mM each final concentration), Viral Forward primer (1.0 mM final concentration), Viral Reverse_519R fusion primers (R1 and R2 combined, 0.01 mM each final concentration), 16S Reverseeverse primer (1.0 mM final concentration), bovine serum albumin (0.02 mg/mL final concentration), Tween-20 (0.8% v/v final concentration), and Phusion High-Fidelity DNA Polymerase (1.5U; New England BioLabs, Inc.) to a final volume of 75 mL. PCR reactions were combined with 450 mL of 4% UMIL EM90 oil (4% UMIL EM90 oil, 0.05% TritonX-100 v/v in mineral oil; Universal Preserv-A-Chem, Inc.) and emulsified by vortexing at max speed (~2,700 rpm) for one minute. Emulsions were loaded as 50 mL aliquots and amplified with the following conditions: denaturation at 94°C for 3 minutes; 33 cycles of denaturation at 94°C for 10 seconds, annealing at 54°C for 30 seconds, and extension at 72°C for 45 seconds; and a final extension of 72°C for 5 minutes (C1000, BioRad Labs., Inc.).

PCR oil emulsions were broken with isobutanol. Briefly, PCR aliquots were pooled in a 1.5mL microcentrifuge tube and combined with 100 uL of sterile 5M NaCl solution and 1 mL of isobutanol. Samples were vortexed briefly to mix and centrifuged at 16,000 x g for 1 minute. The bottom aqueous layer was retained, and DNA was purified by spin column purification (Zymo, Inc.). DNA was eluted in 20 uL of Tris-HCl.

Amplicons were enriched by nested PCR and subsequently sequenced on a PacBio Sequel with Sequel v3 chemistry (University of Maryland Institute for Genome Sciences). Circular consensus sequences were obtained from raw reads with the following parameters: minimum signal-to-noise ratio (SNR): 3, minimum length: 500bp, minimum passes: 10, minimum read score: 0.75, minimum predicted accuracy: 0.90.
Purpose This dataset is available to the public for a wide variety of uses including scientific research and analysis.
Use Limitations
  • accessLevel: Public
  • Distribution liability: NOAA and NCEI make no warranty, expressed or implied, regarding these data, nor does the fact of distribution constitute such a warranty. NOAA and NCEI cannot assume liability for any damages caused by any errors or omissions in these data. If appropriate, NCEI can only certify that the data it distributes are an authentic copy of the records that were accepted for inclusion in the NCEI archives.
Dataset Citation
  • Cite as: Preheim, Sarah (2024). Water quality data, gene copy number counts, and 16S analyses from Rhode River samples collected at SERC during 2017 and 2018 (NCEI Accession 0288317). [indicate subset used]. NOAA National Centers for Environmental Information. Dataset. https://www.ncei.noaa.gov/archive/accession/0288317. Accessed [date].
Cited Authors
Principal Investigators
Contributors
Resource Providers
Points of Contact
Publishers
Acknowledgments
Theme keywords NODC DATA TYPES THESAURUS NODC OBSERVATION TYPES THESAURUS WMO_CategoryCode
  • oceanography
BCO-DMO Standard Parameters Global Change Master Directory (GCMD) Science Keywords Originator Parameter Names
Data Center keywords NODC COLLECTING INSTITUTION NAMES THESAURUS NODC SUBMITTING INSTITUTION NAMES THESAURUS Global Change Master Directory (GCMD) Data Center Keywords
Instrument keywords NODC INSTRUMENT TYPES THESAURUS BCO-DMO Standard Instruments Originator Instrument Names
Project keywords BCO-DMO Standard Projects Provider Funding Award Information
Keywords NCEI ACCESSION NUMBER
Use Constraints
  • Cite as: Preheim, Sarah (2024). Water quality data, gene copy number counts, and 16S analyses from Rhode River samples collected at SERC during 2017 and 2018 (NCEI Accession 0288317). [indicate subset used]. NOAA National Centers for Environmental Information. Dataset. https://www.ncei.noaa.gov/archive/accession/0288317. Accessed [date].
Data License
Access Constraints
  • Use liability: NOAA and NCEI cannot provide any warranty as to the accuracy, reliability, or completeness of furnished data. Users assume responsibility to determine the usability of these data. The user is responsible for the results of any application of this data for other than its intended purpose.
Fees
  • In most cases, electronic downloads of the data are free. However, fees may apply for custom orders, data certifications, copies of analog materials, and data distribution on physical media.
Lineage information for: dataset
Processing Steps
  • 2024-02-02T15:30:17Z - NCEI Accession 0288317 v1.1 was published.
Output Datasets
Acquisition Information (collection)
Instrument
  • multi-parameter water quality sensor
  • PCR machine
Last Modified: 2024-05-31T15:15:28Z
For questions about the information on this page, please email: ncei.info@noaa.gov