Valohai blog

Insights from the deep learning industry.

All Posts

New Release: Managing Your Experiments Just Got Easier

Recreating experiments inside Valohai could be a whole lot easier and we’ve heard your cries!

With the latest release, live today, whenever you copy an old experiment and want to re-run it Valohai now copies the tags and title over from the previous experiment. Tags are also now automatically propagated down to individual executions when you create a task with several ones e.g. during a hyperparameter sweep.

We also improved the creation of new experiments. When creating a new execution you now have a dropdown for selecting your Docker image. We pre-fill the filterable box with our list of recommended Docker images but you can naturally point to any custom made Docker image as well. And naturally, if you have defined a default one in your valohai.yaml file that will be the default one selected from the get-go.

image-selection
We have also fixed several bugs and made a handful of smaller fixes in the UI and the API that you can read more about in the patch notes.

About Valohai
Valohai is a Deep Learning Management Platform that automates your deep learning infrastructure so you can concentrate on data science. Scale your model to hundreds of CPUs, GPUs and TPUs at the click of a button. Create an audit trail and reproduce any previous run with built-in version control for input data, hyperparameters, training algorithms and environments. Manage your entire ML pipeline with automatic coordination from feature extraction and training to inference.

Ruksi
Ruksi
Machine Learning Engineer

Related Posts

From Zero to Hero with Valohai CLI, Part 2

Part 2: Tips and tricks for running your deep learning executions on Valohai CLI Valohai executions can be triggered directly from the CLI and let you roll up your sleeves and fine-tune your options a bit more hands-on than our web-based UI. In part one, I showed you how to install and get started with Valohai’s command-line interface (CLI). Now, it’s time to take a deeper dive and power up with features that’ll take your daily productivity to new heights.

Machine learning infrastructure lessons from Netflix

Ville Tuulos, machine learning infrastructure architect, was the first to publicly dissect Netflix’s Machine Learning infrastructure at QCon in November 2018 in San Francisco. If you haven’t seen the talk yet, read the summary of his talk here! All the pictures used here, are from Ville's presentation. The full talk is 49 minutes long and you can watch it in its entirety on YouTube. From a scattered toolset to a coherent machine learning platform Ville starts by comparing Machine Learning Infrastructure to an online store and how building one was truly a technical problem twenty years ago. Back then you needed to build the whole online shop yourself starting from setting up the servers because the cloud did not exist. New platforms and technologies have since emerged that allow basically anyone to build up an online store and nowadays it is more about knowing the customers than setting up the webshop.

Building Machine Learning Infrastructure at Netflix

In our series of machine learning infrastructure blog posts, we recently featured Uber’s Michelangelo. Today we’re happy to be interviewing Ville Tuulos from Netflix. Ville is a machine learning infrastructure architect at Netflix’s Los Gatos, CA office.