Operational taxonomic unit (OTU) table for 18S rRNA gene tag sequences from DNA and RNA from samples collected in coastal California in 2013 and 2014 (NCEI Accession 0277250)
This dataset contains biological data collected on R/V Yellowfin during cruise SPOT_Yellowfin_Cruises from 2013-04-24 to 2014-01-15. These data include taxon and taxon_code. The instruments used to collect these data include Automated DNA Sequencer, CTD Sea-Bird SBE 911plus, and Niskin bottle. These data were collected by David Caron and Sarah K Hu of University of Southern California as part of the "Protistan, prokaryotic, and viral processes at the San Pedro Ocean Time-series (SPOT)" project. The Biological and Chemical Oceanography Data Management Office (BCO-DMO) submitted these data to NCEI on 2023-01-23.
The following is the text of the dataset description provided by BCO-DMO:
Dataset Description: This dataset is a raw output operational taxonomic unit (OTU) table generated by processing and clustering raw 18S rRNA gene tag sequences from extracted DNA and RNA. Columns represent samples, including month sampled, material (either extracted RNA or DNA), and depth (in meters); thus values in each column represent the number of sequences in that sample that belong to a given OTU (OTUs by row). Each row represents a single OTU. The last column lists the taxonomic identifier assigned to each OTU. The raw sequence data can be found in the NCBI SRA database under accession number SRP070577 with the associated BioProject PRJNA311248.
Metadata for these sequences can be found in the dataset: ”18S rRNA gene tag sequences from DNA and RNA": https://www.bco-dmo.org/dataset/745527
The following is the text of the dataset description provided by BCO-DMO:
Dataset Description: This dataset is a raw output operational taxonomic unit (OTU) table generated by processing and clustering raw 18S rRNA gene tag sequences from extracted DNA and RNA. Columns represent samples, including month sampled, material (either extracted RNA or DNA), and depth (in meters); thus values in each column represent the number of sequences in that sample that belong to a given OTU (OTUs by row). Each row represents a single OTU. The last column lists the taxonomic identifier assigned to each OTU. The raw sequence data can be found in the NCBI SRA database under accession number SRP070577 with the associated BioProject PRJNA311248.
Metadata for these sequences can be found in the dataset: ”18S rRNA gene tag sequences from DNA and RNA": https://www.bco-dmo.org/dataset/745527
Dataset Citation
- Cite as: Caron, David; Hu, Sarah K. (2023). Operational taxonomic unit (OTU) table for 18S rRNA gene tag sequences from DNA and RNA from samples collected in coastal California in 2013 and 2014 (NCEI Accession 0277250). [indicate subset used]. NOAA National Centers for Environmental Information. Dataset. https://www.ncei.noaa.gov/archive/accession/0277250. Accessed [date].
Dataset Identifiers
ISO 19115-2 Metadata
gov.noaa.nodc:0277250
Download Data |
|
Distribution Formats |
|
Ordering Instructions | Contact NCEI for other distribution options and instructions. |
Distributor |
NOAA National Centers for Environmental Information +1-301-713-3277 NCEI.Info@noaa.gov |
Dataset Point of Contact |
NOAA National Centers for Environmental Information ncei.info@noaa.gov |
Time Period | 2013-04-24 to 2014-01-15 |
Spatial Bounding Box Coordinates |
West: -118.475167
East: -118.259167
South: 33.452833
North: 33.7125
|
Spatial Coverage Map |
General Documentation |
|
Associated Resources |
|
Publication Dates |
|
Data Presentation Form | Digital table - digital representation of facts or figures systematically displayed, especially in columns |
Dataset Progress Status | Complete - production of the data has been completed Historical archive - data has been stored in an offline storage facility |
Data Update Frequency | As needed |
Supplemental Information | Acquisition Description: These data were published in Hu et al., 2016. This dataset is a raw output operational taxonomic unit (OTU) table generated by processing and clustering raw 18S rRNA gene tag sequences from DNA and RNA. The numbers in each column represent the number of sequences from that sample belonging to a given OTU (row), with the last column listing the taxonomic ID assigned to each OTU. The raw sequence data can be found in the NCBI SRA database under accession number with the associated BioProject . Metadata for these sequences can be found in the dataset: ”18S rRNA gene tag sequences from DNA and RNA": https://www.bco-dmo.org/dataset/745527 Nucleotide bases with a Q score lower than 20 for the last 30 bp of each sequence were trimmed. Paired-end sequences were merged using FLASh (Magoc and Salzberg 2011) with a minimum of 10 bp and maximum of 150 bp overlap between each sequence pair. Sequences shorter than 350 bp, longer than 460 bp, or which had an average quality score lower than 25 were discarded using QIIME v1.8 (Caporaso et al . 2010). Chimeric sequences were identified and removed, by either de novo or reference-based chimera checking (identify chimeric seqs.py in QIIME, intersection method). The code release v2 associated with this version of the dataset can be downloaded as a .zip file from the Supplemental Documents section of this page. Future code updates will be accessible from the GitHub repository . |
Purpose | This dataset is available to the public for a wide variety of uses including scientific research and analysis. |
Use Limitations |
|
Dataset Citation |
|
Cited Authors | |
Principal Investigators | |
Contributors | |
Resource Providers | |
Points of Contact | |
Publishers | |
Acknowledgments |
Theme keywords |
NODC DATA TYPES THESAURUS
NODC OBSERVATION TYPES THESAURUS
WMO_CategoryCode
|
Data Center keywords | NODC COLLECTING INSTITUTION NAMES THESAURUS NODC SUBMITTING INSTITUTION NAMES THESAURUS Global Change Master Directory (GCMD) Data Center Keywords |
Platform keywords | NODC PLATFORM NAMES THESAURUS BCO-DMO Platform Names Global Change Master Directory (GCMD) Platform Keywords ICES/SeaDataNet Ship Codes |
Instrument keywords | NODC INSTRUMENT TYPES THESAURUS BCO-DMO Standard Instruments Global Change Master Directory (GCMD) Instrument Keywords Originator Instrument Names |
Place keywords | Provider Place Names |
Project keywords | NODC PROJECT NAMES THESAURUS BCO-DMO Standard Projects Provider Cruise IDs Provider Funding Award Information |
Keywords | NCEI ACCESSION NUMBER |
Use Constraints |
|
Data License | |
Access Constraints |
|
Fees |
|
Lineage information for: dataset | |
---|---|
Processing Steps |
|
Output Datasets |
|
Acquisition Information (collection) | |
---|---|
Instrument |
|
Platform |
|
Last Modified: 2024-05-31T15:15:28Z
For questions about the information on this page, please email: ncei.info@noaa.gov
For questions about the information on this page, please email: ncei.info@noaa.gov