Skip to main content
Dataset Overview | National Centers for Environmental Information (NCEI)

Voucher summary of invertebrates and barcoded OTU's with field identifications, collected from Palau marine lakes from 2014-07-11 to 2015-07-27 (NCEI Accession 0277940)

browse graphicGraphic not available.
This dataset contains biological and survey - biological data collected on Small boats - CRRF during cruise Palau_lakes from 2014-07-11 to 2015-07-27. These data include family, genus, order, phylum, and taxon. The instruments used to collect these data include Automated DNA Sequencer and PCR Thermal Cycler. These data were collected by Michael N Dawson of University of California-Merced as part of the "Do Parallel Patterns Arise from Parallel Processes? (PaPaPro)" project and "Dimensions of Biodiversity (Dimensions of Biodiversity)" program. The Biological and Chemical Oceanography Data Management Office (BCO-DMO) submitted these data to NCEI on 2019-07-08.

The following is the text of the dataset description provided by BCO-DMO:

Palau lakes: voucher summary

Dataset Description:
Voucher summary of invertebrates and barcoded OTU's with field identifications, collected from Palau marine lakes. Summaries by major taxon (e.g. phylum) are provided.

* NOTE: The P.I.'s are using this dataset to write papers. Please contact them before using these data to make sure you are not duplicating efforts.
  • Cite as: Dawson, Michael N. (2023). Voucher summary of invertebrates and barcoded OTU's with field identifications, collected from Palau marine lakes from 2014-07-11 to 2015-07-27 (NCEI Accession 0277940). [indicate subset used]. NOAA National Centers for Environmental Information. Dataset. https://www.ncei.noaa.gov/archive/accession/0277940. Accessed [date].
gov.noaa.nodc:0277940
Download Data
  • HTTPS (download)
    Navigate directly to the URL for data access and direct download.
  • FTP (download)
    These data are available through the File Transfer Protocol (FTP). FTP is no longer supported by most internet browsers. You may copy and paste the FTP link to the data into an FTP client (e.g., FileZilla or WinSCP).
Distribution Formats
  • TSV
Ordering Instructions Contact NCEI for other distribution options and instructions.
Distributor NOAA National Centers for Environmental Information
+1-301-713-3277
NCEI.Info@noaa.gov
Dataset Point of Contact NOAA National Centers for Environmental Information
ncei.info@noaa.gov
Time Period 2014-07-11 to 2015-07-27
Spatial Bounding Box Coordinates
West: 134.3447
East: 134.5089
South: 7.1506
North: 7.3237
Spatial Coverage Map
General Documentation
Associated Resources
  • Biological, chemical, physical, biogeochemical, ecological, environmental and other data collected from around the world during historical and contemporary periods of biological and chemical oceanographic exploration and research managed and submitted by the Biological and Chemical Oceanography Data Management Office (BCO-DMO)
  • Dawson, M. N. (2019) Voucher summary of invertebrates and barcoded OTU's with field identifications, collected from Palau marine lakes. Biological and Chemical Oceanography Data Management Office (BCO-DMO). Dataset version 2019-05-13. https://doi.org/10.1575/1912/bco-dmo.768196.1
  • Parent ID (indicates this dataset is related to other data):
    • gov.noaa.nodc:BCO-DMO
Publication Dates
  • publication: 2023-05-06
Data Presentation Form Digital table - digital representation of facts or figures systematically displayed, especially in columns
Dataset Progress Status Complete - production of the data has been completed
Historical archive - data has been stored in an offline storage facility
Data Update Frequency As needed
Supplemental Information
Acquisition Description:
After completion of fieldwork, a subset of specimens from the transect surveys were chosen for DNA barcoding to confirm or amend field identifications. These specimens included (i) at least one specimen from each field-ID (except obvious species such as Mastigias papua ) and (ii) several specimens representing the range of phenotypic variation of field-IDs that showed considerable variation or were challenging to distinguish (e.g. small sponge specimens of similar color and texture). Additionally, specimens from a previously collected voucher collection (indicated with “V_” in prefix of sequence ID) were barcoded and identified by taxonomic experts. Specimens from population genetic collections (indicated with “PG_” in prefix of sequence ID) were also barcoded. DNA was purified using a modified phenol-chloroform CTAB extraction protocol (1) or AcroPrep PALL 5053 glass fiber plates procedure (2, 3). We amplified the Cytochrome c Oxidase subunit I (COI) barcode locus using 0.5 µL of purified DNA in a 25-µL polymerase chain reaction (PCR) with 0.05 µL AMPLITAQ (Applied Biosystems, Foster City, California, USA), 2.5 µL 10x buffer (Applied Biosystems), 0.63 µL of 20 µM primers (Operon Biotechnologies Inc., Huntsville, Alabama, USA), 2.5 µL of 25 mM MgCl2 (Applied Biosystems), 0.5 µL of 10 mg/mL bovine serum albumin (BSA) and 0.5 µL of 10 mM dNTPs. Several primer sets were used (Table 1). Amplicons were sequenced at the University of California Berkeley DNA Sequencing Facility (Berkeley, California, USA). Base calls in electropherograms were visually checked and manually corrected for errors and forward and reverse reads were assembled in Sequencher 4.8 (GeneCodes, Ann Arbor, Michigan, USA). We used Basic Local Alignment Search Tool (BLASTn) to determine the higher level taxonomic assignment for each sequence (which we used to process batches of similar sequences) — ascidians, bivalves, bryozoans, cnidarians, crustaceans, echinoderms, gastropods, polychaetes, and poriferans. Sequences organized by these broad groups were then aligned using Muscle v3.8.425 (4). For each group, alignments were manually adjusted and trimmed to the same length in Mesquite v3.5 (5) to balance total individuals retained and sequence length. The resulting alignment lengths were: ascidians 395bp, bivalves 567bp, bryozoans 622bp, cnidarians 612bp, crustaceans 299bp, echinoderms 357bp, gastropods 562bp, polychaetes 509bp, and poriferans 688bp. Sequences were translated to amino acid sequence to confirm an open reading frame. Short sequences were excluded from further analysis, but percent pairwise identity with the closest match was recorded for each based on the shortest sequence. Pairwise sequence distance was calculated using dist.dna with Kimura’s 2-parameter distance model of evolution (6) in the ape package v4.1 (7) in R (8). OTUs, or clusters of sequences, similar at 97% were identified using tclust in the spider package v1.5.0 (9) in R (8) for each taxonomic group, except for poriferans, which were clustered at 99% sequence similarity given their slow sequence evolution (10).

1. Dawson MN, Raskoff KA, Jacobs DK (1998) Field preservation of marine invertebrate tissue for DNA analyses. Mol Mar Biol Biotechnol 7(2):145–52.

2. Ivanova N V., Dewaard JR, Hebert PDN (2006) An inexpensive, automation-friendly protocol for recovering high-quality DNA. Mol Ecol Notes 6(4):998–1002.

3. Schiebelhut LM, Abboud SS, Gómez Daglio LE, Swift HF, Dawson MN (2017) A comparison of DNA extraction methods for high-throughput DNA analyses. Mol Ecol Resour 17(4):721–729.

4. Edgar RC (2004) MUSCLE: Multiple sequence alignment with high accuracy and high throughput. Nucleic Acids Res 32(5):1792–1797.

5. Maddison WP, Maddison DR (2018) Mesquite: a modular system for evolutionary analysis.

6. Kimura M (1980) A simple method for estimating evolutionary rates of base substitutions through comparative studies of nucleotide sequences. J Mol Evol 16(2):111–120.

7. Paradis E, Claude J, Strimmer K (2004) APE: Analyses of phylogenetics and evolution in R language. Bioinformatics 20(2):289–290.

8. R Core Team (2018) R: A language and environment for statistical computing (R Foundation for Statistical Computing, Vienna, Austria).

9. BROWN SDJ, et al. (2012) Spider: An R package for the analysis of species identity and evolution, with particular reference to DNA barcoding. Mol Ecol Resour 12(3):562–565.

10. Huang D, Meier R, Todd PA, Chou LM (2008) Slow mitochondrial COI sequence evolution at the base of the metazoan tree and its implications for DNA barcoding. J Mol Evol 66(2):167–174.

See Table 1. Primers and thermocycle conditions used for PCR of macroinvertebrates by taxonomic group in Supplemental Documents, below.

For the sequence alignment files (.fas) mentioned in the methods above, see the Supplemental Files section below.
Purpose This dataset is available to the public for a wide variety of uses including scientific research and analysis.
Use Limitations
  • accessLevel: Public
  • Distribution liability: NOAA and NCEI make no warranty, expressed or implied, regarding these data, nor does the fact of distribution constitute such a warranty. NOAA and NCEI cannot assume liability for any damages caused by any errors or omissions in these data. If appropriate, NCEI can only certify that the data it distributes are an authentic copy of the records that were accepted for inclusion in the NCEI archives.
Dataset Citation
  • Cite as: Dawson, Michael N. (2023). Voucher summary of invertebrates and barcoded OTU's with field identifications, collected from Palau marine lakes from 2014-07-11 to 2015-07-27 (NCEI Accession 0277940). [indicate subset used]. NOAA National Centers for Environmental Information. Dataset. https://www.ncei.noaa.gov/archive/accession/0277940. Accessed [date].
Cited Authors
Principal Investigators
Resource Providers
Points of Contact
Publishers
Acknowledgments
Theme keywords NODC DATA TYPES THESAURUS NODC OBSERVATION TYPES THESAURUS WMO_CategoryCode
  • oceanography
BCO-DMO Standard Parameters Originator Parameter Names
Data Center keywords NODC SUBMITTING INSTITUTION NAMES THESAURUS Global Change Master Directory (GCMD) Data Center Keywords
Platform keywords BCO-DMO Platform Names Global Change Master Directory (GCMD) Platform Keywords
Instrument keywords NODC INSTRUMENT TYPES THESAURUS BCO-DMO Standard Instruments Originator Instrument Names
Place keywords Provider Place Names
Project keywords BCO-DMO Standard Programs BCO-DMO Standard Projects Provider Cruise IDs Provider Funding Award Information
Keywords NCEI ACCESSION NUMBER
Use Constraints
  • Cite as: Dawson, Michael N. (2023). Voucher summary of invertebrates and barcoded OTU's with field identifications, collected from Palau marine lakes from 2014-07-11 to 2015-07-27 (NCEI Accession 0277940). [indicate subset used]. NOAA National Centers for Environmental Information. Dataset. https://www.ncei.noaa.gov/archive/accession/0277940. Accessed [date].
Data License
Access Constraints
  • Use liability: NOAA and NCEI cannot provide any warranty as to the accuracy, reliability, or completeness of furnished data. Users assume responsibility to determine the usability of these data. The user is responsible for the results of any application of this data for other than its intended purpose.
Fees
  • In most cases, electronic downloads of the data are free. However, fees may apply for custom orders, data certifications, copies of analog materials, and data distribution on physical media.
Lineage information for: dataset
Processing Steps
  • 2023-05-06T04:18:44Z - NCEI Accession 0277940 v1.1 was published.
Output Datasets
Acquisition Information (collection)
Instrument
  • PCR machine
Last Modified: 2024-09-16T21:36:20Z
For questions about the information on this page, please email: ncei.info@noaa.gov