Leveraging Existing Data To Penetrate Saturated Markets

Posted Leave a commentPosted in Big Data, business, Business Intelligence, data, Data Science, market, Marketing, metrics, penetration, sales, SmartData Collective Exclusive

Ever since big data become a buzz term companies have leveraged data they’ve collected to improve market penetration. This strategy has worked for the better part of the last ten years or so. Unfortunately, traditional online and social marketing platforms have gotten so saturated that it’s nearly impossible to penetrate certain demographics at this point. […]

New Features in MLflow v0.6.0

Posted Leave a commentPosted in Data Science, Engineering Blog, Machine Learning, MLflow, Model Management, Platform, Spark ML

Today, we’re excited to announce MLflow v0.6.0, released early in the week with new features. Now available on PyPI and Maven, the docs are updated. You can install the recent release with pip install mlflow as described in the MLflow quickstart guide. MLflow v0.6.0 introduces a number of major features: A Java client API, available […]

MLflow On-Demand Webinar and FAQ Now Available!

Posted Leave a commentPosted in Data Science, Deep Learning, Ecosystem, Engineering Blog, Machine Learning, MLflow, Model Management, Platform, Product, Unified Analytics Platform

On August 30th, our team hosted a live webinar—Introducing MLflow: Infrastructure for a complete Machine Learning lifecycle—with Matei Zaharia, Co-Founder and Chief Technologist at Databricks. In this webinar, we walked you through MLflow, a new open source project from Databricks that aims to design an open ML platform where organizations can use any ML library […]

A Guide to Apache Spark Use Cases, Streaming, and Research Talks at Spark + AI Summit Europe

Posted Leave a commentPosted in Apache Spark, Company Blog, Data Science, Events, Genomics, Machine Learning, PySpark, Spark + AI Summit Europe, Structured Streaming, Unified Analytics Platform

For much of Apache Spark’s history, its capacity to process data at scale and capability to unify disparate workloads has led Spark developers to tackle new use cases. Through innovation and extension of its ecosystem, developers combine data and AI to develop new applications. So it befits developers to come to this summit not just […]

The Low-Down On Using Data Science And Statistics In Advertising

Posted Leave a commentPosted in advertising, Big Data, Data Science, Marketing, statistics

The use of data science and statistics in advertising has grown substantially in recent years, especially as innovations in the field of big data analytics make it easier than ever before to collect and use data on a large scale. Despite the increasing digitization of the field of advertising, however, many professionals don’t know where […]

Here Are The Top 4 Data Analytics Jobs To Look Out For

Posted Leave a commentPosted in Big Data, big data scientists, careers, data analytics, data analytics jobs, Data Science, Jobs, SmartData Collective Exclusive

What is the job of a data engineer? How is it different from the job of a data analyst? What does a data scientist do? Confused? Let us put all the different aspect of data analytics jobs into different buckets and make it all easier for you. If you are looking to migrate to data analytics […]

How to Use MLflow to Experiment a Keras Network Model: Binary Classification for Movie Reviews

Posted Leave a commentPosted in Apache Spark, Data Science, Engineering Blog, Machine Learning, MLflow, Model Management, Platform, python, Unified Analytics Platform

In the last blog post, we demonstrated the ease with which you can get started with MLflow, an open-source platform to manage machine learning lifecycle. In particular, we illustrated a simple Keras/TensorFlow model using MLflow and PyCharm. This time we explore a binary classification Keras network model. Using MLflow’s Tracking APIs, we will track metrics—accuracy […]