Machine Learning Icon

Machine Learning

Machine Learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
198 episodes
All Topics

Practical AI Practical AI #217

Accelerated data science with a Kaggle grandmaster

Play
2023-04-04T20:00:00Z #ai +3 šŸŽ§ 26,706

Daniel and Chris explore the intersection of Kaggle and real-world data science in this illuminating conversation with Christof Henkel, Senior Deep Learning Data Scientist at NVIDIA and Kaggle Grandmaster. Christof offers a very lucid explanation into how participation in Kaggle can positively impact a data scientistā€™s skill and career aspirations. He also shared some of his insights and approach to maximizing AI productivity uses GPU-accelerated tools like RAPIDS and DALI.

Practical AI Practical AI #215

AI search at You.com

Play
2023-03-15T19:15:00Z #ai +1 šŸŽ§ 25,546

Neural search and chat-based search are all the rage right now. However, You.com has been innovating in these topics long before ChatGPT. In this episode, Bryan McCann from You.com shares insights related to our mental model of Large Language Model (LLM) interactions and practical tips related to integrating LLMs into production systems.

Practical AI Practical AI #214

End-to-end cloud compute for AI/ML

Play
2023-03-07T20:00:00Z #ai +2 šŸŽ§ 24,172

Weā€™ve all experienced pain moving from local development, to testing, and then on to production. This cycle can be long and tedious, especially as AI models and datasets are integrated. Modal is trying to make this loop of development as seamless as possible for AI practitioners, and their platform is pretty incredible!

Erik from Modal joins us in this episode to help us understand how we can run or deploy machine learning models, massively parallel compute jobs, task queues, web apps, and much more, without our own infrastructure.

Practical AI Practical AI #212

Applied NLP solutions & AI education

Play
2023-02-22T15:15:00Z #ai +2 šŸŽ§ 24,263

Weā€™re super excited to welcome Jay Alammar to the show. Jay is a well-known AI educator, applied NLP practitioner at co:here, and author of the popular blog, ā€œThe Illustrated Transformer.ā€ In this episode, he shares his ideas on creating applied NLP solutions, working with large language models, and creating educational resources for state-of-the-art AI.

Practical AI Practical AI #211

Serverless GPUs

Play
2023-02-14T21:30:00Z #ai +2 šŸŽ§ 23,655

Weā€™ve been hearing about ā€œserverlessā€ CPUs for some time, but itā€™s taken a while to get to serverless GPUs. In this episode, Erik from Banana explains why its taken so long, and he helps us understand how these new workflows are unlocking state-of-the-art AI for application developers. Forget about servers, but donā€™t forget to listen to this one!

Practical AI Practical AI #210

MLOps is alive and well

Play
2023-02-07T21:00:00Z #ai +2 šŸŽ§ 22,879

Worlds are colliding! This week we join forces with the hosts of the MLOps.Community podcast to discuss all things machine learning operations. We talk about how the recent explosion of foundation models and generative models is influencing the world of MLOps, and we discuss related tooling, workflows, perceptions, etc.

Practical AI Practical AI #207

Machine learning at small organizations

Play
2023-01-17T20:15:00Z #ai +1 šŸŽ§ 26,680

Why is ML is so poorly adopted in small organizations (hint: itā€™s not because they donā€™t have enough data)? In this episode, Kirsten Lum from Storytellers shares the patterns she has seen in small orgs that lead to a successful ML practice. We discuss how the job of a ML Engineer/Data Scientist is different in that environment and how end-to-end project management is key to adoption.

Practical AI Practical AI #205

NLP research by & for local communities

Play
2023-01-03T20:15:00Z #ai +2 šŸŽ§ 19,811

While at EMNLP 2022, Daniel got a chance to sit down with an amazing group of researchers creating NLP technology that actually works for their local language communities. Just Zwennicker (Universiteit van Amsterdam) discusses his work on a machine translation system for Sranan Tongo, a creole language that is spoken in Suriname. Andiswa Bukula (SADiLaR), Rooweither Mabuya (SADiLaR), and Bonaventure Dossou (Lanfrica, Mila) discuss their work with Masakhane to strengthen and spur NLP research in African languages, for Africans, by Africans.

The group emphasized the need for more linguistically diverse NLP systems that work in scenarios of data scarcity, non-Latin scripts, rich morphology, etc. You donā€™t want to miss this one!

Practical AI Practical AI #203

AI competitions & cloud resources

Play
2022-12-07T21:00:00Z #ai +2 šŸŽ§ 20,437

In this special episode, we interview some of the sponsors and teams from a recent case competition organized by Purdue University, Microsoft, INFORMS, and SIL International. 170+ teams from across the US and Canada participated in the competition, which challenged students to create AI-driven systems to caption images in three languages (Thai, Kyrgyz, and Hausa).

Practical AI Practical AI #202

Copilot lawsuits & Galactica "science"

Play
2022-11-29T20:45:00Z #fully-connected +2 šŸŽ§ 19,861

There are some big AI-related controversies swirling, and itā€™s time we talk about them. A lawsuit has been filed against GitHub, Microsoft, and OpenAI related to Copilot code suggestions, and many people have been disturbed by the output of Meta AIā€™s Galactica model. Does Copilot violate open source licenses? Does Galactica output dangerous science-related content? In this episode, we dive into the controversies and risks, and we discuss the benefits of these technologies.

Practical AI Practical AI #201

Protecting us with the Database of Evil

Play
2022-11-16T17:20:00Z #ai +3 šŸŽ§ 20,500

Online platforms and their users are susceptible to a barrage of threats ā€“ from disinformation to extremism to terror. Daniel and Chris chat with Matar Haller, VP of Data at ActiveFence, a leader in identifying online harm ā€“ is using a combination of AI technology and leading subject matter experts to provide Trust & Safety teams with precise, real-time data, in-depth intelligence, and automated tools to protect users and ensure safe online experiences.

Practical AI Practical AI #196

What's up, DocQuery?

Play
2022-10-12T15:00:00Z #ai +3 šŸŽ§ 19,411

Chris sits down with Ankur Goyal to talk about DocQuery, Impiraā€™s new open source ML model. DocQuery lets you ask questions about semi-structured data (like invoices) and unstructured documents (like contracts) using Large Language Models (LLMs). Ankur illustrates many of the ways DocQuery can help people tame documents, and references Chrisā€™s real life tasks as a non-profit director to demonstrate that DocQuery is indeed practical AI.

Practical AI Practical AI #195

Production data labeling workflows

Play
2022-09-27T19:40:00Z #ai +2 šŸŽ§ 21,007

Itā€™s one thing to gather some labels for your data. Itā€™s another thing to integrate data labeling into your workflows and infrastructure in a scalable, secure, and useful way. Mark from Xelex joins us to talk through some of what he has learned after helping companies scale their data annotation efforts. We get into workflow management, labeling instructions, team dynamics, and quality assessment. This is a super practical episode!

Practical AI Practical AI #194

Evaluating models without test data

Play
2022-09-20T19:20:00Z #ai +2 šŸŽ§ 20,078

WeightWatcher, created by Charles Martin, is an open source diagnostic tool for analyzing Neural Networks without training or even test data! Charles joins us in this episode to discuss the tool and how it fills certain gaps in current model evaluation workflows. Along the way, we discuss statistical methods from physics and a variety of practical ways to modify your training runs.

Changelog Interviews Changelog Interviews #506

Stable Diffusion breaks the internet

Play
2022-09-16T20:45:00Z #oss +2 šŸŽ§ 36,359

This week on The Changelog weā€™re talking about Stable Diffusion, DALL-E, and the impact of AI generated art. We invited our good friend Simon Willison on the show today because he wrote a very thorough blog post titled, ā€œStable Diffusion is a really big deal.ā€

You may know Simon from his extensive contributions to open source software. Simon is a co-creator of the Django Web framework (which we donā€™t talk about at all on this show), heā€™s the creator of Datasette, a multi-tool for exploring and publishing data (which we do talk about on this show)ā€¦most of all Simon is a very insightful thinker, which he puts on display here on this episode. We talk from all the angles of this topic, the technical, the innovation, the future and possibilities, the ethical and the moral ā€“ we get into it all. The question is, will this era be known as the initial push back to the machine?

Practical AI Practical AI #193

Stable Diffusion

Play
2022-09-13T22:20:00Z #ai +1 šŸŽ§ 22,975

The new stable diffusion model is everywhere! Of course you can use this model to quickly and easily create amazing, dream-like images to post on twitter, reddit, discord, etc., but this technology is also poised to be used in very pragmatic ways across industry. In this episode, Chris and Daniel take a deep dive into all things stable diffusion. They discuss the motivations for the work, the model architecture, and the differences between this model and other related releases (e.g., DALLĀ·E 2).

alt text
(Image from stability.ai)

Practical AI Practical AI #192

Licensing & automating creativity

Play
2022-09-06T21:45:00Z #fully-connected +2 šŸŽ§ 18,253

AI is increasingly being applied in creative and artistic ways, especially with recent tools integrating models like Stable Diffusion. This is making some artists mad. How should we be thinking about these trends more generally, and how can we as practitioners release and license models anticipating human impacts? We explore this along with other topics (like AI models detecting swimming pools šŸ˜Š) in this fully connected episode.

Practical AI Practical AI #191

Privacy in the age of AI

Play
2022-08-30T19:20:00Z #privacy +4 šŸŽ§ 18,858

In this Fully-Connected episode, Daniel and Chris discuss concerns of privacy in the face of ever-improving AI / ML technologies. Evaluating AIā€™s impact on privacy from various angles, they note that ethical AI practitioners and data scientists have an enormous burden, given that much of the general population may not understand the implications of the data privacy decisions of everyday life.

This intentionally thought-provoking conversation advocates consideration and action from each listener when it comes to evaluating how their own activities either protect or violate the privacy of those whom they impact.

Practical AI Practical AI #190

Practical, positive uses for deep fakes

Play
2022-08-24T15:00:00Z #ai +2 šŸŽ§ 18,351

Differentiating between what is real versus what is fake on the internet can be challenging. Historically, AI deepfakes have only added to the confusion and chaos, but when labeled and intended for good, deepfakes can be extremely helpful. But with all of the misinformation surrounding deepfakes, it can be hard to see the benefits they bring. Lior Hakim, CTO at Hour One, joins Chris and Daniel to shed some light on the practical uses of deepfakes. He addresses the AI technology behind deepfakes, how to make positive use of deep fakes such as breaking down communications barriers, and shares how Hour One specializes in the development of virtual humans for use in professional video communications.

Practical AI Practical AI #188

AlphaFold is revolutionizing biology

Play
2022-08-09T19:45:00Z #ai +2 šŸŽ§ 19,570

AlphaFold is an AI system developed by DeepMind that predicts a proteinā€™s 3D structure from its amino acid sequence. It regularly achieves accuracy competitive with experiment, and is accelerating research in nearly every field of biology. Daniel and Chris delve into protein folding, and explore the implications of this revolutionary and hugely impactful application of AI.

Practical AI Practical AI #187

AI IRL & Mozilla's Internet Health Report

Play
2022-08-02T20:30:00Z #ai +3 šŸŽ§ 18,183

Every year Mozilla releases an Internet Health Report that combines research and stories exploring what it means for the internet to be healthy. This yearā€™s report is focused on AI. In this episode, Solana and Bridget from Mozilla join us to discuss the power dynamics of AI and the current state of AI worldwide. They highlight concerning trends in the application of this transformational technology along with positive signs of change.

Player art
  0:00 / 0:00