Engineering population scale Genome-Wide Association Studies with Apache Spark, Delta Lake, and MLflow

Posted Leave a commentPosted in AI, Apache Spark, Company Blog, Customers, Data and ML Industry Use Case, Data Engineering, Data Science and Machine Learning, Delta Lake, Education, Engineering Blog, genome sequencing, GWAS, Managed MLflow, MLflow

Try this notebook series in Databricks The advent of genome-wide association studies (GWAS) in the late 2000s enabled scientists to begin to understand the causes of complex diseases such as diabetes and Crohn’s disease at their most fundamental level. However, academic bioinformatics tools to perform GWAS have not kept pace with the growth of genomic […]