Using AutoML Toolkit’s FamilyRunner Pipeline APIs to Simplify and Automate Loan Default Predictions

Posted Leave a commentPosted in Apache Spark, AutoML, data-science, Engineering Blog, Machine Learning, Pipeline API

Try this Loan Risk with AutoML Pipeline API Notebook in Databricks Introduction In the post Using AutoML Toolkit to Automate Loan Default Predictions, we had shown how the Databricks Labs’ AutoML Toolkit simplified Machine Learning model feature engineering and model building optimization (MBO).  It also had improved the area-under-the-curve (AUC) from 0.6732 (handmade XGBoost model) […]

A Guide to Training Sessions at Spark + AI Summit, Europe

Posted Leave a commentPosted in Apache Spark, Company Blog, Data and ML Industry Use Case, Data and ML Research, Data Engineering, Data Science, Data Science and Machine Learning, data-science, Delta Lake, Education, Events, Keras, MLflow, Productionizing Machine Learning, PyTorch, Spark + AI Summit, Spark SQL, Structured Streaming, TensorFlow, training

Education and the pursuit of knowledge are lifelong journeys: they never complete; there is always something new to learn; a new professional certification to add to your credit; a knowledge gap to fill. Training at Spark + AI Summit, Europe is not only about becoming an Apache Spark expert. Nor is it only about being […]