MLflow v0.7.0 Features New R API by RStudio

Posted Leave a commentPosted in Announcements, Apache Spark, Company Blog, Deep Learning, Ecosystem, Education, Engineering Blog, GPyOpt, Hyperopt, Java, Keras, Machine Learning, MLflow, multistep workflow, Partners, python, R, RStudio

Today, we’re excited to announce MLflow v0.7.0, released with new features, including a new MLflow R client API contributed by RStudio. A testament to MLflow’s design goal of an open platform with adoption in the community, RStudio’s contribution extends the MLflow platform to a larger R community of data scientists who use RStudio and R […]

100x Faster Bridge between Apache Spark and R with User-Defined Functions on Databricks

Posted Leave a commentPosted in Apache Spark, Databricks, Engineering Blog, Machine Learning, R, SparkR, Unified Analytics Platform

SparkR User-Defined Function (UDF) API opens up opportunities for big data workloads running on Apache Spark to embrace R’s rich package ecosystem. Some of our customers that have R experts on board use SparkR UDF API to blend R’s sophisticated packages into their ETL pipeline, applying transformations that go beyond Spark’s built-in functions on the […]

rquery: Practical Big Data Transforms for R-Spark Users

Posted Leave a commentPosted in Apache Spark, Big Data, Data Science, Engineering Blog, Machine Learning, R, Spark SQL, SparkR

This is a guest community blog from Nina Zumel and John Mount, data scientists and consultants at Win-Vector. They share how to use rquery with Apache Spark on Databricks Try this notebook in Databricks Introduction In this blog, we will introduce rquery, a powerful query tool that allows R users to implement powerful data transformations […]

Data Science Survey: The Results Are In!

Posted Leave a commentPosted in Announcements, Data Science, data science survey, emerging technologies, python, R

Last week we ran a Data Science survey asking four simple questions to our community. In this post, I’ll show you the results of our survey and provide you with a Jupyter notebook; just in case you want to play with the data yourself. Disclaimer 2,233 people participated in the survey. This is a statistically significant […]