Linear Digressions

Channel Details

Linear Digressions

Linear Digressions

Creator: Ben Jaffe and Katie Malone

Podcast by Ben Jaffe and Katie Malone

EN United States Technology

Recent Episodes

291 episodes
So long, and thanks for all the fish

So long, and thanks for all the fish

All good things must come to an end, including this podcast. This is the last episode we plan to release, and it doesn’t cover data science—it’s mostl...

2020-07-26 10:32:44 00:35:44
A Reality Check on AI-Driven Medical Assistants

A Reality Check on AI-Driven Medical Assistants

The data science and artificial intelligence community has made amazing strides in the past few years to algorithmically automate portions of the heal...

2020-07-19 10:51:31 00:14:00
A Data Science Take on Open Policing Data

A Data Science Take on Open Policing Data

A few weeks ago, we put out a call for data scientists interested in issues of race and racism, or people studying how those topics can be studied wit...

2020-07-12 13:02:39 00:23:44
Procella: YouTube's super-system for analytics data storage

Procella: YouTube's super-system for analytics data storage

This is a re-release of an episode that originally ran in October 2019.

If you’re trying to manage a project that serves up analytics dat...

2020-07-05 13:29:24 00:29:48
The Data Science Open Source Ecosystem

The Data Science Open Source Ecosystem

Open source software is ubiquitous throughout data science, and enables the work of nearly every data scientist in some way or another. Open source pr...

2020-06-28 13:34:48 00:23:06
Rock the ROC Curve

Rock the ROC Curve

This is a re-release of an episode that first ran on January 29, 2017.

This week: everybody's favorite WWII-era classifier metric! But...

2020-06-21 10:34:29 00:15:52
Criminology and Data Science

Criminology and Data Science

This episode features Zach Drake, a working data scientist and PhD candidate in the Criminology, Law and Society program at George Mason University. Z...

2020-06-14 12:26:26 00:30:57
Racism, the criminal justice system, and data science

Racism, the criminal justice system, and data science

As protests sweep across the United States in the wake of the killing of George Floyd by a Minneapolis police officer, we take a moment to dig into on...

2020-06-07 10:33:53 00:31:36
An interstitial word from Ben

An interstitial word from Ben

A message from Ben around algorithmic bias, and how our models are sometimes reflections of ourselves.

2020-06-04 12:38:43 00:05:59
Convolutional Neural Networks

Convolutional Neural Networks

This is a re-release of an episode that originally aired on April 1, 2018

If you've done image recognition or computer vision tasks with...

2020-05-31 08:46:31 00:21:55
Stein's Paradox

Stein's Paradox

This is a re-release of an episode that was originally released on February 26, 2017.

When you're estimating something about some object...

2020-05-24 09:21:28 00:27:02
Protecting Individual-Level Census Data with Differential Privacy

Protecting Individual-Level Census Data with Differential Privacy

The power of finely-grained, individual-level data comes with a drawback: it compromises the privacy of potentially anyone and everyone in the dataset...

2020-05-17 12:49:22 00:21:19
Causal Trees

Causal Trees

What do you get when you combine the causal inference needs of econometrics with the data-driven methodology of machine learning? Usually these two do...

2020-05-10 12:34:33 00:15:27
The Grammar Of Graphics

The Grammar Of Graphics

You may not realize it consciously, but beautiful visualizations have rules. The rules are often implict and manifest themselves as expectations about...

2020-05-03 12:12:53 00:35:38
Gaussian Processes

Gaussian Processes

It’s pretty common to fit a function to a dataset when you’re a data scientist. But in many cases, it’s not clear what kind of function might be most...

2020-04-26 12:33:43 00:20:55
Keeping ourselves honest when we work with observational healthcare data

Keeping ourselves honest when we work with observational healthcare data

The abundance of data in healthcare, and the value we could capture from structuring and analyzing that data, is a huge opportunity. It also presents...

2020-04-19 13:43:37 00:19:08
Changing our formulation of AI to avoid runaway risks: Interview with Prof. Stuart Russell

Changing our formulation of AI to avoid runaway risks: Interview with Prof. Stuart Russell

AI is evolving incredibly quickly, and thinking now about where it might go next (and how we as a species and a society should be prepared) is critica...

2020-04-12 12:55:01 00:28:58
Putting machine learning into a database

Putting machine learning into a database

Most data scientists bounce back and forth regularly between doing analysis in databases using SQL and building and deploying machine learning pipelin...

2020-04-05 12:51:56 00:24:22
The work-from-home episode

The work-from-home episode

Many of us have the privilege of working from home right now, in an effort to keep ourselves and our family safe and slow the transmission of covid-19...

2020-03-29 09:23:42 00:29:06
Understanding Covid-19 transmission: what the data suggests about how the disease spreads

Understanding Covid-19 transmission: what the data suggests about how the disease spreads

Covid-19 is turning the world upside down right now. One thing that’s extremely important to understand, in order to fight it as effectively as possib...

2020-03-22 12:03:34 00:25:25
Network effects re-release: when the power of a public health measure lies in widespread adoption

Network effects re-release: when the power of a public health measure lies in widespread adoption

This week’s episode is a re-release of a recent episode, which we don’t usually do but it seems important for understanding what we can all do to slow...

2020-03-15 09:43:38 00:26:40
Causal inference when you can't experiment: difference-in-differences and synthetic controls

Causal inference when you can't experiment: difference-in-differences and synthetic controls

When you need to untangle cause and effect, but you can’t run an experiment, it’s time to get creative. This episode covers difference in differences...

2020-03-08 12:39:19 00:20:48
Better know a distribution: the Poisson distribution

Better know a distribution: the Poisson distribution

This is a re-release of an episode that originally ran on October 21, 2018.

The Poisson distribution is a probability distribution functi...

2020-03-01 12:55:28 00:31:51
The Lottery Ticket Hypothesis

The Lottery Ticket Hypothesis

Recent research into neural networks reveals that sometimes, not all parts of the neural net are equally responsible for the performance of the netwo...

2020-02-23 09:03:25 00:19:45
Interesting technical issues prompted by GDPR and data privacy concerns

Interesting technical issues prompted by GDPR and data privacy concerns

Data privacy is a huge issue right now, after years of consumers and users gaining awareness of just how much of their personal data is out there and...

2020-02-16 11:50:20 00:20:26
Thinking of data science initiatives as innovation initiatives

Thinking of data science initiatives as innovation initiatives

Put yourself in the shoes of an executive at a big legacy company for a moment, operating in virtually any market vertical: you’re constantly hearing...

2020-02-09 11:10:21 00:17:27
Building a curriculum for educating data scientists: Interview with Prof. Xiao-Li Meng

Building a curriculum for educating data scientists: Interview with Prof. Xiao-Li Meng

As demand for data scientists grows, and it remains as relevant as ever that practicing data scientists have a solid methodological and technical foun...

2020-02-02 09:36:23 00:31:36
Running experiments when there are network effects

Running experiments when there are network effects

Traditional A/B tests assume that whether or not one person got a treatment has no effect on the experiment outcome for another person. But that’s not...

2020-01-26 10:13:52 00:24:45
Zeroing in on what makes adversarial examples possible

Zeroing in on what makes adversarial examples possible

Adversarial examples are really, really weird: pictures of penguins that get classified with high certainty by machine learning algorithms as drumsets...

2020-01-19 12:41:20 00:22:51
Unsupervised Dimensionality Reduction: UMAP vs t-SNE

Unsupervised Dimensionality Reduction: UMAP vs t-SNE

Dimensionality reduction redux: this episode covers UMAP, an unsupervised algorithm designed to make high-dimensional data easier to visualize, cluste...

2020-01-12 10:53:19 00:29:34
Data scientists: beware of simple metrics

Data scientists: beware of simple metrics

Picking a metric for a problem means defining how you’ll measure success in solving that problem. Which sounds important, because it is, but oftentime...

2020-01-05 08:54:57 00:24:47
Communicating data science, from academia to industry

Communicating data science, from academia to industry

For something as multifaceted and ill-defined as data science, communication and sharing best practices across the field can be extremely valuable but...

2019-12-29 11:53:14 00:26:15
Optimizing for the short-term vs. the long-term

Optimizing for the short-term vs. the long-term

When data scientists run experiments, like A/B tests, it’s really easy to plan on a period of a few days to a few weeks for collecting data. The thing...

2019-12-22 12:50:53 00:19:24
Interview with Prof. Andrew Lo, on using data science to inform complex business decisions

Interview with Prof. Andrew Lo, on using data science to inform complex business decisions

This episode features Prof. Andrew Lo, the author of a paper that we discussed recently on Linear Digressions, in which Prof. Lo uses data to predict...

2019-12-15 13:15:09 00:27:46
Using machine learning to predict drug approvals

Using machine learning to predict drug approvals

One of the hottest areas in data science and machine learning right now is healthcare: the size of the healthcare industry, the amount of data it gene...

2019-12-08 08:56:05 00:25:00
Facial recognition, society, and the law

Facial recognition, society, and the law

Facial recognition being used in everyday life seemed far-off not too long ago. Increasingly, it’s being used and advanced widely and with increasing...

2019-12-01 13:14:14 00:43:09
Lessons learned from doing data science, at scale, in industry

Lessons learned from doing data science, at scale, in industry

If you’ve taken a machine learning class, or read up on A/B tests, you likely have a decent grounding in the theoretical pillars of data science. But...

2019-11-24 10:45:42 00:28:00
Varsity A/B Testing

Varsity A/B Testing

When you want to understand if doing something causes something else to happen, like if a change to a website causes and dip or rise in downstream con...

2019-11-17 12:09:46 00:36:00
The Care and Feeding of Data Scientists: Growing Careers

The Care and Feeding of Data Scientists: Growing Careers

In the third and final installment of a conversation with Michelangelo D’Agostino, VP of Data Science and Engineering at Shoprunner, about growing and...

2019-11-10 13:44:18 00:25:19
The Care and Feeding of Data Scientists: Recruiting and Hiring Data Scientists

The Care and Feeding of Data Scientists: Recruiting and Hiring Data Scientists

This week’s episode is the second in a three-part interview series with Michelangelo D’Agostino, VP of Data Science at Shoprunner. This discussion cen...

2019-11-03 10:21:56 00:20:16
0:00
0:00
Episode
No title available
No channel info