James Le

July 28, 2019

Datacast

Datacast Episode 18: NLP at Crisis Text Line with Ankit Gupta

James Le

July 28, 2019

Datacast

Ankit Gupta is a data scientist at Crisis Text Line, a New York-based not-for-profit tech startup providing 24/7 free text-based crisis support to individuals. At Crisis Text Line, Ankit developed an AI-driven triage system that detects signals of suicidality within the first few messages sent by texters. Using the triage system, the counselors can serve 93% of at-risk texters in under five minutes.

James Le

July 16, 2019

Datacast

Datacast Episode 17: Computer Vision Research with Genevieve Patterson

James Le

July 16, 2019

Datacast

Genevieve Patterson is the chief scientist at TRASH, a startup that is developing computational filmmaking tools for mobile iphono-graphers. Before that, she was a Postdoctoral Researcher at Microsoft Research New England. Her work is about creating dialog between AI and people. Her interests include video understanding, visual attribute discovery, human-in-the-loop systems, fine-grained object recognition, medical image understanding, and active learning. Genevieve received her Ph.D. from Brown University in 2016 under the direction of James Hays.

James Le

July 9, 2019

Datacast

Datacast Episode 16: Bayesian Probabilistic Programming with Peadar Coyle

James Le

July 9, 2019

Datacast

Peadar Coyle is a Data Scientist and Engineer based in London. He regularly speaks at conferences and has written a book consisting of numerous interviews with Data Scientists throughout the world. He is also a passionate Open-Source evangelist, himself a supporter who has contributed to PyMC3. Most recently, he founded a stealth startup that is working on hyper-personalized audio.

James Le

June 29, 2019

Datacast

Datacast Episode 15: Thoughts on Data Science from the perspective of a behavioral scientist with Nick Gaylord

James Le

June 29, 2019

Datacast

Nick Gaylord has worked as a data scientist in the Bay Area for about the last 5 years. Currently, he’s a member of the Johnson & Johnson Health Technology team, and prior to that, he has worked in different fields ranging from small business revenue analytics to enterprise machine-learning-as-a-service platforms. Like many data scientists, he started out as an academic before transitioning to industry, in his case earning a Ph.D. in Psycholinguistics from the University of Texas at Austin in 2013.

James Le

June 4, 2019

Datacast

Datacast Episode 14: Overcoming Impostor Syndrome with Conor Dewey

James Le

June 4, 2019

Datacast

Conor Dewey is a data scientist at Squarespace who spends his time thinking about growth and engagement. He frequently shares insights from his experience interviewing at top companies. More generally, he offers resources and advice on Medium and Github in the hopes of ‘open sourcing’ his journey to data science. Conor also manages a weekly data science newsletter with 1000+ subscribers.

James Le

May 4, 2019

Datacast

Datacast Episode 13: Transition From Academia to Data Science with Martina Pugliese

James Le

May 4, 2019

Datacast

Martina is a physicist and works as Data Science Lead at Mallzee, based in Edinburgh (Scotland). She loves looking at data regardless of the topic and area and believes the most enjoyable thing in Data Science is analyzing your data, finding the one you need for your question, and producing facts out of it. Throughout her education and job experience, she worked with data in epidemic dynamics, linguistics, and fashion. She also loves producing hand-crafted data visualizations and keeps studying and improving, whether it’s about Machine Learning or leadership topics.

James Le

April 30, 2019

Computer Science

An Introduction to Big Data: Distributed Data Processing

James Le

April 30, 2019

Computer Science

An Introduction to Big Data: Distributed Data Processing

This semester, I’m taking a graduate course called Introduction to Big Data. It provides a broad introduction to the exploration and management of large datasets being generated and used in the modern world. In an effort to open-source this knowledge to the wider data science community, I will recap the materials I will learn from the class in Medium. Having a solid understanding of the basic concepts, policies, and mechanisms for big data exploration and data mining is crucial if you want to build end-to-end data science projects.

James Le

April 26, 2019

Computer Science

An Introduction to Big Data: Decision Trees

James Le

April 26, 2019

Computer Science

An Introduction to Big Data: Decision Trees

This semester, I’m taking a graduate course called Introduction to Big Data. It provides a broad introduction to the exploration and management of large datasets being generated and used in the modern world. In an effort to open-source this knowledge to the wider data science community, I will recap the materials I will learn from the class in Medium. Having a solid understanding of the basic concepts, policies, and mechanisms for big data exploration and data mining is crucial if you want to build end-to-end data science projects.

James Le

April 11, 2019

Computer Science

An Introduction to Big Data: Clustering

James Le

April 11, 2019

Computer Science

This semester, I’m taking a graduate course called Introduction to Big Data. It provides a broad introduction to the exploration and management of large datasets being generated and used in the modern world. In an effort to open-source this knowledge to the wider data science community, I will recap the materials I will learn from the class in Medium. Having a solid understanding of the basic concepts, policies, and mechanisms for big data exploration and data mining is crucial if you want to build end-to-end data science projects.

James Le

April 3, 2019

Computer Science

An Introduction to Big Data: Itemset Mining

James Le

April 3, 2019

Computer Science

An Introduction to Big Data: Itemset Mining

This semester, I’m taking a graduate course called Introduction to Big Data. It provides a broad introduction to the exploration and management of large datasets being generated and used in the modern world. In an effort to open-source this knowledge to the wider data science community, I will recap the materials I will learn from the class in Medium. Having a solid understanding of the basic concepts, policies, and mechanisms for big data exploration and data mining is crucial if you want to build end-to-end data science projects.