TY - JOUR AU - Navarro, Fábio C. P. AU - Mohsen, Hussein AU - Yan, Chengfei AU - Li, Shantao AU - Gu, Mengting AU - Meyerson, William AU - Gerstein, Mark PY - 2019 DA - 2019/05/29 TI - Genomics and data science: an application within an umbrella JO - Genome Biology SP - 109 VL - 20 IS - 1 AB - Data science allows the extraction of practical insights from large-scale data. Here, we contextualize it as an umbrella term, encompassing several disparate subdomains. We focus on how genomics fits as a specific application subdomain, in terms of well-known 3 V data and 4 M process frameworks (volume-velocity-variety and measurement-mining-modeling-manipulation, respectively). We further analyze the technical and cultural “exports” and “imports” between genomics and other data-science subdomains (e.g., astronomy). Finally, we discuss how data value, privacy, and ownership are pressing issues for data science applications, in general, and are especially relevant to genomics, due to the persistent nature of DNA. SN - 1474-760X UR - https://doi.org/10.1186/s13059-019-1724-1 DO - 10.1186/s13059-019-1724-1 ID - Navarro2019 ER -