Building The Open Data Ecosystem For Music And More At Metabrainz

00:00:00
/
00:48:06

August 17th, 2020

48 mins 6 secs

Your Hosts

About this Episode

Summary

The Musicbrainz project was an early entry in the movement to build an open data ecosystem. In recent years, the Metabrainz Foundation has fostered a growing ecosystem of projects to support the contribution of, and access to, metadata, listening habits, and review of music. The majority of those projects are written in Python, and in this episode Param Singh explains how they are built, how they fit together, and how they support the goals of the Metabrains Foundation. This was an interesting exporation of the work involved in building an ecosystem of open data, the challenges of making it sustainable, and the benefits of building for the long term rather than trying to achieve a quick win.

Announcements

  • Hello and welcome to Podcast.__init__, the podcast about Python and the people who make it great.
  • When you’re ready to launch your next app or want to try a project you hear about on the show, you’ll need somewhere to deploy it, so take a look at our friends over at Linode. With the launch of their managed Kubernetes platform it’s easy to get started with the next generation of deployment and scaling, powered by the battle tested Linode platform, including simple pricing, node balancers, 40Gbit networking, dedicated CPU and GPU instances, and worldwide data centers. Go to pythonpodcast.com/linode and get a $60 credit to try out a Kubernetes cluster of your own. And don’t forget to thank them for their continued support of this show!
  • Before you put your code into production you need to make sure that it passes all of the tests, that it has been packaged with all of the dependencies, and that you haven’t introduced any security issues. Instead of running all of that on your laptop, let Codefresh handle it automatically with their continuous integration and continuous delivery platform. Built for the modern era of cloud-native computing, they make publishing to Kubernetes, serverless platforms, and virtual machines fast and seamless. With a growing library of pre-made steps, a flexible pipeline definition, and unlimited scale Codefresh lets you ship faster and safer than ever. Go to pythonpodcast.com/codefresh today to get unlimited builds on your free account.
  • You listen to this show to learn and stay up to date with the ways that Python is being used, including the latest in machine learning and data analysis. For more opportunities to stay up to date, gain new skills, and learn from your peers there are a growing number of virtual events that you can attend from the comfort and safety of your home. Go to pythonpodcast.com/conferences to check out the upcoming events being offered by our partners and get registered today!
  • Your host as usual is Tobias Macey and today I’m interviewing Param Singh about the ways that Python is being used across the various Metabrainz projects

Interview

  • Introductions
  • How did you get introduced to Python?
  • Can you start by giving an overview of what the Metabrainz organization is and the various projects that it encompasses?
    • What are the motivations for creating those projects and some of the origin story for Metabrainz?
  • The Musicbrainz server is the longest running project and is written in Perl. What was the reason for switching to Python for all of the other *brainz projects?
  • How does the MetaBrainz Foundation sustain itself? Where do the funds come from?
    • How do you determine where and how to allocate the funding that you receive?
  • Which of the *brainz projects is the most complex or challenging to build, whether due to technical or sociological reasons?
  • How do you source and manage the information that powers all of the Metabrainz projects?
  • How is development of the various projects organized?
    • How does that influence the amount of code sharing that is possible between them?
  • Of the projects that you have been involved in, how are they architected?
    • What are the main ways that the projects differ in how they are implemented?
  • What are some of the ways that you are using Python in support of the various projects that you work on?
  • What are some of the most interesting, innovative, or unexpected ways that you have seen the projects or data built by Metabrainz being used?
  • What are some of the most interesting, unexpected, or challenging lessons that you have learned while working as a contributor and maintainer of the Metabrainz projects?
  • What is in store for the future of the existing Metabrainz projects?
  • What are the next domains that are being considered for building a Metabrainz platform for?

Keep In Touch

Picks

Links

The intro and outro music is from Requiem for a Fish The Freak Fandango Orchestra / CC BY-SA