What’s new with MLflow? On-Demand Webinar and FAQs now available!

Posted Leave a commentPosted in Data Science, Engineering Blog, Machine Learning, Managed MLflow, MLflow, Model Management, Open Source

On June 6th, our team hosted a live webinar—Managing the Complete Machine Learning Lifecycle: What’s new with MLflow—with Clemens Mewald, Director of Product Management at Databricks. Machine learning development brings many new complexities beyond the traditional software development lifecycle. Unlike in traditional software development, ML developers want to try multiple algorithms, tools and parameters to […]

Scikit-Learn For Machine Learning Application Development In Python

Posted Leave a commentPosted in app development, Machine Learning, Open Source, scikit-learn, SmartData Collective Exclusive, Software

Python is arguably the best programming language for machine learning. However, many aspiring machine learning developers don’t know where to start. They should look into the scikit-learn library, which is one of the best for developing machine learning applications. It is free and relatively easy to install and learn. Why machine learning programmers should be […]

Machine Learning Pioneers a New Generation of Technical Writing Solutions

Posted Leave a commentPosted in Artificial Intelligence, Machine Learning

  My colleagues and I at Smart Data Collective have written extensively about the benefits of big data in fields like marketing, hospitality and cybersecurity.  We sometimes realize that we need to discuss the implications of big data for other fields as well.  Technical writing is one field that is highly affected by advances in […]

Detecting Bias with SHAP – The Databricks Blog

Posted Leave a commentPosted in Apache Spark, Bias, Deep Learning, Education, Engineering Blog, Machine Learning, MLflow, SHAP, Stack Overflow

StackOverflow’s annual developer survey concluded earlier this year, and they have graciously published the (anonymized) 2019 results for analysis. They’re a rich view into the experience of software developers around the world — what’s their favorite editor? how many years of experience? tabs or spaces? and crucially, salary. Software engineers’ salaries are good, and sometimes […]

Machine Learning Delivers Cutting-Edge POS Software For Online Stores

Posted Leave a commentPosted in Machine Learning, machine learning software, POS, SmartData Collective Exclusive, Software

Online retailers are using machine learning solutions to deliver the highest level of service to their customers. They have found that big data makes it easier to personalize services and offer the highest value for the lowest cost. One of the best ways machine learning is helping online retailers is with new POS software applications. […]

A Guide To Machine Learning Foundations of Task Management Software

Posted Leave a commentPosted in Machine Learning, SmartData Collective Exclusive, Software, task management, task management software

Task management applications are changing the way we manage teams. Here are some of the primary benefits of these task management applications: Task management tools improve team productivity Task management tools make sure that teams operate more efficiently Task management tools minimize worker stress Task management tools help with monitoring trends Machine learning is playing […]

Hyperparameter Tuning with MLflow, Apache Spark MLlib and Hyperopt

Posted Leave a commentPosted in Apache Spark, AutoML, Data Science, Databricks Runtime 5.4 ML, Deep Learning, Ecosystem, Engineering Blog, Hyperopt, Hyperparameter Tuning, Machine Learning, MLflow, MLlib

Hyperparameter tuning is a common technique to optimize machine learning models based on hyperparameters, or configurations that are not learned during model training.  Tuning these configurations can dramatically improve model performance. However, hyperparameter tuning can be computationally expensive, slow, and unintuitive even for experts. Databricks Runtime 5.4 and 5.4 ML (Azure | AWS) introduce new […]

Announcing the MLflow 1.0 Release

Posted Leave a commentPosted in Announcements, Company Blog, Data Science, Ecosystem, Engineering Blog, Lifecycle, Machine Learning, MLflow, Model Management, Product

MLflow is an open source platform to help manage the complete machine learning lifecycle. With MLflow, data scientists can track and share experiments locally (on a laptop) or remotely (in the cloud), package and share models across frameworks, and deploy models virtually anywhere. Today we are excited to announce the release of MLflow 1.0. Since […]

Enhanced Hyperparameter Tuning and Optimized AWS Storage with Databricks Runtime 5.4 ML

Posted Leave a commentPosted in Announcements, AutoML, Company Blog, Data Science, Databricks Runtime 5.4 ML, Deep Learning, Ecosystem, Engineering Blog, Hyperopt, Hyperparameter Tuning, Machine Learning, MLflow, MLlib, Platform, Product

We are excited to announce the release of Databricks Runtime 5.4 ML (Azure | AWS). This release includes two Public Preview features to improve data science productivity, optimized storage in AWS for developing distributed applications, and a number of Python library upgrades. To get started, you simply select the Databricks Runtime 5.4 ML from the […]

Introducing Databricks Runtime 5.4 with Conda (Beta)

Posted Leave a commentPosted in Announcements, Company Blog, Data Science, Databricks Runtime, Deep Learning, Ecosystem, Engineering Blog, Machine Learning, Product

We are excited to introduce a new runtime: Databricks Runtime 5.4 with Conda (Beta). This runtime uses Conda to manage Python libraries and environments. Many of our Python users prefer to manage their Python environments and libraries with Conda, which quickly is emerging as a standard. Conda takes a holistic approach to package management by […]