6 Data And Analytics Trends To Prepare For In 2020

Posted Leave a commentPosted in #GDPR, Analytics, analytics trends, Big Data, Business Intelligence, Cloud Computing, data analytics trends, Data Science, data trends, Machine Learning, machine learning skills, Predictive Analytics, SmartData Collective Exclusive

We’re well past the point of realization that big data and advanced analytics solutions are valuable — just about everyone knows this by now. In fact, there’s no escaping the increasing reliance on such technologies. Big data alone has become a modern staple of nearly every industry from retail to manufacturing, and for good reason. […]

6 Ways Artificial Intelligence is Transforming Marketing Forever

Posted Leave a commentPosted in AI, Artificial Intelligence, artificial intelligence in marketing, Machine Learning, Marketing, SmartData Collective Exclusive

If you’re a marketer or business owner in today’s competitive marketplace, you’ve probably tried just about everything you can think of to maximize your success. You’ve dabbled in digital marketing, visited trade shows, paid for print advertising, and incentivized customer testimonials. It’s probably resulted in lots of stress, sleepless nights, and CBD oil drops to […]

Detecting Financial Fraud at Scale with Decision Trees and MLflow on  Databricks

Posted Leave a commentPosted in Apache Spark, Company Blog, Decision tree, Education, Engineering Blog, financial, Financial Markets, Financial Services, Fraud, Fraud Detection, Machine Leanring, Machine Learning, Platform

Try this notebook in Databricks Detecting fraudulent patterns at scale is a challenge, no matter the use case. The massive amounts of data to sift through, the complexity of the constantly evolving techniques, and the very small number of actual examples of fraudulent behavior are comparable to finding a needle in a haystack while not […]

Understanding Dynamic Time Warping – The Databricks Blog

Posted Leave a commentPosted in Apache Spark, Company Blog, Dynamic Time Warping, Education, Engineering Blog, Machine Learning, Platform

Try this notebook in Databricks This blog is part 1 of our two-part series Using Dynamic Time Warping and MLflow to Detect Sales Trends. To go to part 2, go to Using Dynamic Time Warping and MLflow to Detect Sales Trends. The phrase “dynamic time warping,” at first read, might evoke images of Marty McFly […]

Using Dynamic Time Warping and MLflow to Detect Sales Trends

Posted Leave a commentPosted in Apache Spark, Company Blog, Dynamic Time Warping, Education, Engineering Blog, Machine Learning, MLflow, Platform

Try this notebook series in Databricks This blog is part 2 of our two-part series Using Dynamic Time Warping and MLflow to Detect Sales Trends.  The phrase “dynamic time warping,” at first read, might evoke images of Marty McFly driving his DeLorean at 88 MPH in the Back to the Future series. Alas, dynamic time warping does […]

Introducing MLflow Run Sidebar in Databricks Notebooks

Posted Leave a commentPosted in Announcements, Company Blog, Engineering Blog, Machine Learning, Managed MLflow, MLflow, Platform, Sidebar

At Spark+AI Summit 2019, we announced the GA of Managed MLflow on Databricks in which we take the latest and greatest of open source MLflow and make it easily accessible to all users of Databricks. In that blog post, we promised to build features which bridge Databricks and MLflow concepts to create a seamless integration […]

Announcing General Availability of Managed MLflow on Databricks

Posted Leave a commentPosted in Announcements, Company Blog, Ecosystem, Engineering Blog, Machine Learning, Managed MLflow, MLflow, Platform, Product

Try this tutorial in Databricks MLflow is an open source platform to help manage the complete machine learning lifecycle. With MLflow, data scientists can track and share experiments locally or in the cloud, package and share models across frameworks, and deploy models virtually anywhere. Today at the Spark + AI Summit, we announced the General […]

Koalas: Easy Transition from pandas to Apache Spark

Posted Leave a commentPosted in Announcements, Apache Spark, Company Blog, Data Science, Ecosystem, Education, Engineering Blog, Machine Learning, Open Source, Pandas, python

Today at Spark + AI Summit, we announced Koalas, a new open source project that augments PySpark’s DataFrame API to make it compatible with pandas. Python data science has exploded over the past few years and pandas has emerged as the lynchpin of the ecosystem. When data scientists get their hands on a data set, […]

A Guide to MLflow Talks at Spark + AI Summit 2019

Posted Leave a commentPosted in Company Blog, Events, Machine Learning, MLflow, Open Source, Product, Spark + AI Summit

In less than a year, MLflow has reached almost 500K monthly downloads, and gathered over 80 code contributors and 40 contributing organizations, confirming the need for an open source approach to help standardize the machine learning lifecycle across tools, teams, and processes. We are thrilled to host some of our key contributors and customers next […]