Productionizing Machine Learning: From Deployment to Drift Detection

Posted Leave a commentPosted in AI, Company Blog, Data and ML Industry Use Case, Data Science and Machine Learning, Machine Learning, Machine Learning Life Cycle, MLflow, Model Drift, Product, Tutorials

Try this notebook to reproduce the steps outlined below and watch our on-demand webinar to learn more. In many literature and blogs, a machine learning workflow starts with data prep and ends with deploying a model to production. But in reality, that’s just the beginning of the lifecycle of a machine learning model. As they say, […]

Guest Blog: How Virgin Hyperloop One reduced processing time from hours to minutes with Koalas

Posted Leave a commentPosted in Apache Spark, Company Blog, Customers, Data Science, Data Science and Machine Learning, Developer, Ecosystem, Engineering Blog, Koalas, Machine Learning, Pandas, PySpark, python, Tutorials

At Virgin Hyperloop One, we work on making Hyperloop a reality, so we can move passengers and cargo at airline speeds but at a fraction of the cost of air travel. In order to build a commercially viable system, we collect and analyze a large, diverse quantity of data, including Devloop Test Track runs, numerous […]

What is Flafka? How to use it with Flume for data ingestion [Tutorial]

Posted Leave a commentPosted in Data Science, Tutorials

Apache Kafka is an open-source distributed stream-processing queuing platform, written in Scala and Java. Apache Kafka is used to publishing and subscribe messages in sequential order in the queue. Since Kafka is a fast, scalable, durable, and fault-tolerant publish-subscribe messaging system with higher throughput, reliability and replication characteristics. In the Apache Kafka Distributed Platform, the […]