說明
This dataset represents fish specimen collection records compiled from field surveys and curated for use in databases such as GBIF and OBIS. The original data was sourced from CSV and Excel files and processed using R for the purpose of cleaning, reformatting, standardizing taxonomic and temporal information, and preparing catalog numbers for integration into biodiversity repositories.
資料紀錄
此資源出現紀錄的資料已發佈為達爾文核心集檔案(DwC-A),其以一或多組資料表構成分享生物多樣性資料的標準格式。 核心資料表包含 394 筆紀錄。
此 IPT 存放資料以提供資料儲存庫服務。資料與資源的詮釋資料可由「下載」單元下載。「版本」表格列出此資源的其它公開版本,以便利追蹤其隨時間的變更。
版本
以下的表格只顯示可公開存取資源的已發布版本。
如何引用
研究者應依照以下指示引用此資源。:
Avery T, Oni E (2025). AUW Ichthyology. Version 2.0. Acadia University. Occurrence dataset. https://doi.org/10.5886/yhgsrk
權利
研究者應尊重以下權利聲明。:
此資料的發布者及權利單位為 Acadia University。 To the extent possible under law, the publisher has waived all rights to these data and has dedicated them to the Public Domain (CC0 1.0). Users may copy, modify, distribute and use the work, including for commercial purposes, without restriction.
GBIF 註冊
此資源已向GBIF註冊,並指定以下之GBIF UUID: 3c326b89-69fb-442d-a4a1-1bd54d633cac。 Acadia University 發佈此資源,並經由Canada Biodiversity Information Facility同意向GBIF註冊成為資料發佈者。
關鍵字
Occurrence; Specimen
聯絡資訊
- 元數據提供者
- 編輯
- Data manager
地理涵蓋範圍
Canada | Bahamas
界定座標範圍 | 緯度南界 經度西界 [-90, -180], 緯度北界 經度東界 [90, 180] |
---|
分類群涵蓋範圍
無相關描述
Order | Gadiformes, Percopsiformes, Gasterosteiformes, Scleralcyonacea, Saccopharyngiformes, Tetraodontiformes, Stomiiformes, Acipenseriformes, Salmoniformes, Acanthuriformes, Mugiliformes, Aulopiformes, Myliobatiformes, Myxiniformes, Osmeriformes, Squaliformes, Syngnathiformes, Beloniformes, Scorpaeniformes, Perciformes, Esociformes, Lamniformes, Rajiformes, Siluriformes, Cypriniformes, Anguilliformes, Petromyzontiformes, Pleuronectiformes, Clupeiformes, Myctophiformes |
---|---|
Family | Liparidae, Psychrolutidae, Eurypharyngidae, Mugilidae, Agonidae, Rajidae, Sphyraenidae, Percopsidae, Carangidae, Myctophidae, Ictaluridae, Mopseidae, Synaphobranchidae, Anguillidae, Sternoptychidae, Gerreidae, Cyclopteridae, Triglidae, Pomacentridae, Monacanthidae, Lotidae, Nemichthyidae, Squalidae, Percidae, Stomiidae, Scophthalmidae, Chlorophthalmidae, Phycidae, Exocoetidae, Polynemidae, Howellidae, Esocidae, Etmopteridae, Catostomidae, Blenniidae, Bothidae, Gonostomatidae, Cetorhinidae, Ostraciidae, Paralepididae, Osmeridae, Scaridae, Salmonidae, Tetraodontidae, Pleuronectidae, Cyprinidae, Achiridae, Cichlidae, Sciaenidae, Macrouridae, Alosidae, Stromateidae, Ophichthidae, Zoarcidae, Ammodytidae, Sebastidae, Acipenseridae, Polyodontidae, Cynoglossidae, Muraenidae, Petromyzontidae, Stichaeidae, Pholidae, Paralichthyidae, Gempylidae, Serrivomeridae, Bramidae, Cottidae, Sparidae, Urotrygonidae, Acanthuridae, Labridae, Chaetodontidae, Gasterosteidae, Myxinidae, Lutjanidae, Syngnathidae, Gadidae, Hemitripteridae, Cryptacanthodidae, Centrarchidae, Merlucciidae |
時間涵蓋範圍
起始日期 / 結束日期 | 1836-01-01 / 2019-07-23 |
---|
額外的詮釋資料
Metadata Document: Fish Collection Data Cleaning and Transformation Project Title: Data Cleaning and Standardization of Fish Collection Records Prepared By: Eniola Oni Last Updated: August 14, 2025 ________________________________________ 1. Description This dataset represents fish specimen collection records compiled from field surveys and curated for use in databases such as GBIF and OBIS. The original data was sourced from CSV and Excel files and processed using R for the purpose of cleaning, reformatting, standardizing taxonomic and temporal information, and preparing catalog numbers for integration into biodiversity repositories. ________________________________________ 2. File Descriptions File Name Description New_fish_database(Sheet1).csv Raw dataset initially imported and cleaned. Fish_Database_Masterb.csv Intermediate cleaned version with standardized column names. FISH_DATABASE_CORRECTED.xlsx Corrected version with collection and determined dates separated into day/month/year. FISH_DATABASE_FIXED.xlsx Final cleaned version with catalog numbers, genus-species combinations, and standardized month values. FISH_DATABASE_FIXEDdd.xlsx Exported version ready for integration or sharing. ________________________________________ 3. Software & Libraries Used • R version ≥ 4.0.0 • Libraries: readr, readxl, writexl, dplyr, tidyr, stringr ________________________________________ 4. Data Cleaning & Transformation Summary a. Column Renaming Standardized field names for clarity and consistency: • authority → authority_agent_first_name • date_collection → start_date • depth → Bottom_distance • gear → Collection_object_citation • End_date → determined_date_1 • Determined_by → determiner_first_name_1 • number → Count_amount • time → Start_time • notes → Remarks • collector → Collector_first_name b. Species Separation The binomial species name was split into: • genus • species Combined again for scientificName used in GBIF format: FISH_DATABASE_FIXED$names <- paste(FISH_DATABASE_FIXED$`genus 1`, FISH_DATABASE_FIXED$`species 1`) c. Date Handling • collection_date and determined_date fields were separated into day, month, and year. • Month names (e.g., "Jun", "Dec") were converted to numeric format using str_replace_all. d. Catalog Number Standardization Rules for catalog number formatting: • If year missing → prefix with 2000 • If number missing → suffix with F0000(0001,0002, etc) • Hyphens normalized with spaces: - → - • For catalog that specify considered invalid still, add 0 in front of F (e.g 2012 - F03023) e. NA Handling Empty cells in date components were replaced with blank strings to avoid issues with database ingestion: replace_na(list(determined_day = "", ...) ________________________________________ 5. Important Notes for Database Integration (e.g., Specify) Variable Notes Count_amount Text 6 under Collection Object Attribute gear Text 7 under Collection Object Attribute authority Authority First Name 1 depth Text 9 under Collection Object Attribute collection date Text 11 under Collection Object Attribute determined date Text 12 under Collection Object Attribute collection_day, collection_month, collection_year Number 1, 2, integer 1 under Collection Object determined_day, determined_month, determined_year Number 11, 12, 13 under Collection Object Property ________________________________________ 6. Data Dictionary Variable Unit Description verified Internal flag for data review check Internal flag for manual checks obis Yes/No Indicates if uploaded to OBIS num Index to help users follow catalog numbers cat_no Museum catalog/identification number order Taxonomic order family Taxonomic family species Species name (binomial) authority Name of the person who described the species locality Collection location (site name) latitude Degrees Geographic latitude longitude Degrees Geographic longitude date_collection Date When the specimen was collected collector Person who collected the specimen depth meters Collection depth in the water column gear Equipment used to collect the specimen determined Identifier of the specimen date_determined Date Date when specimen was identified number Count Number of specimens in the lot station number Rarely used, mostly irrelevant time HH:MM Time of specimen collection cruise Identifying term for sampling trip notes Miscellaneous notes ________________________________________ 7. GBIF PUBLISHING COLUMN TITLE Darwin core mapping Remarks Remarks Institution name Institution ID Count amount Individual count Gear Preparations Authority Identifiedby Species SpecificEpithet Collector Recorded by
替代的識別碼 | 10.5886/yhgsrk |
---|---|
3c326b89-69fb-442d-a4a1-1bd54d633cac | |
https://data.canadensys.net/ipt/resource?r=auw-ichthyology |