Simplifying Genomics Pipelines at Scale with Databricks Delta

Posted Leave a commentPosted in Apache Spark, data pipeline, Engineering Blog, Genomics, HLS, Machine Learning, Streaming

Try this notebook in Databricks This blog is the first blog in our “Genomics Analysis at Scale” series. In this series, we will demonstrate how the Databricks UAP4Genomics enables customers to analyze population-scale genomic data. Starting from the output of our genomics pipeline, this series will provide a tutorial on using Databricks to run sample […]

Loan Risk Analysis with XGBoost and Databricks Runtime for Machine Learning

Posted Leave a commentPosted in Apache Spark, Company Blog, data pipeline, Data Visualization, Ecosystem, Education, Engineering Blog, financial, Machine Learning, MLlib, Platform, Product, XGBoost

Try this notebook series in Databricks For companies that make money off of interest on loans held by their customer, it’s always about increasing the bottom line. Being able to assess the risk of loan applications can save a lender the cost of holding too many risky assets. It is the data scientist’s job to […]