Datacast Episode 124: The Open-Source Cloud Playbook, The Modular Future of Data and AI Infrastructure, and Meta-Learning as a VC with Casber Wang

Datacast Episode 124: The Open-Source Cloud Playbook, The Modular Future of Data and AI Infrastructure, and Meta-Learning as a VC with Casber Wang

Casber Wang is Partner at Sapphire Ventures. He focuses primarily on security, enterprise infrastructure, and data analytics.

He is on the boards of Huntress, JumpCloud, StarTree, Tetrate, Upytcs, and Zesty. For his work, Insider listed Casber as an Enterprise VC Rising Star Investor and as an emerging investor charting the industry’s future on the 2022 EVC List.

Prior to Sapphire, he was part of the technology investment banking group at Bank of America Merrill Lynch, where he worked on a number of high-profile IPO and M&A transactions. He also spent time at Wish, a leading mobile commerce platform in North America and Europe.

Datacast Episode 123: AI Monitoring, Product-Oriented Data Science, and the Israeli ML Community with Itai Bar Sinai

Datacast Episode 123: AI Monitoring, Product-Oriented Data Science, and the Israeli ML Community with Itai Bar Sinai

With over 10 years of experience (Google, AI-focused startups) with big data and as the Chief Product Officer and co-founder at Mona, the leading AI monitoring intelligence company, Itai has a unique view of the AI industry. Working closely with data science and ML teams applying dozens of solutions in over 10 industries, Itai encounters a wide variety of business use cases, organizational structures and cultures, and technologies used in today’s AI world.

Datacast Episode 122: The Evolution of Data Visualization, Scaling Data Culture, and The Future of Data Transformation with Gabi Steele

Datacast Episode 122: The Evolution of Data Visualization, Scaling Data Culture, and The Future of Data Transformation with Gabi Steele

Gabi Steele is the co-founder and co-CEO of Preql, a no-code data transformation tool that empowers business users to model and manage their own metrics. She is also the co-founder of Data Culture, a data engineering and visualization consultancy. Previously Gabi led data visualization engineering at WeWork and worked on data storytelling at the Washington Post.

Datacast Episode 121: High-Performance Processing Engine, Modern Data Streaming, and Propelling Minority in Tech with Alex Gallego

Datacast Episode 121: High-Performance Processing Engine, Modern Data Streaming, and Propelling Minority in Tech with Alex Gallego

Alexander Gallego is the founder and CEO of Redpanda Data, a high-performance, Apache Kafka-compatible data streaming platform for mission-critical workloads. He has spent his career immersed in deeply technical environments and is passionate about finding and building solutions to the challenges of modern data streaming.

Before Redpanda, Alex was a principal engineer at Akamai and the co-founder and CTO of Concord.io, a high-performance stream-processing engine acquired by Akamai in 2016. He has also engineered software at Factset Research Systems, Forex Capital Markets, and Yieldmo; and holds a bachelor’s degree in computer science and cryptography from NYU.

Datacast Episode 120: Next-Generation Experimentation, Statistics Engineering, and The Modern Growth Stack with Chetan Sharma

Datacast Episode 120: Next-Generation Experimentation, Statistics Engineering, and The Modern Growth Stack with Chetan Sharma

Chetan Sharma is the Founder & CEO of Eppo, a next-gen A/B experimentation platform that is designed to spur entrepreneurial culture.

As the 4th data scientist at Airbnb and an early data scientist at companies like Webflow, Chetan has been focused on the maturity curve of growth-stage companies and how to establish data as a central stakeholder in decision-making. He previously led the team that developed Airbnb's knowledge repo and has led data teams focused on production machine learning and instrumentation integrity.

Datacast Episode 119: Experimentation Culture, Immutable Data Warehouse, The Data Collaboration Problem, and The Rise of Data Contracts with Chad Sanderson

Datacast Episode 119: Experimentation Culture, Immutable Data Warehouse, The Data Collaboration Problem, and The Rise of Data Contracts with Chad Sanderson

Chad Sanderson was the Product Lead for Convoy's Data Platform team, which includes the data warehouse, streaming, BI & visualization, experimentation, machine learning, and data discovery.

Previously he worked on Microsoft's AI Platform team and led Data initiatives at SEPHORA and Subway. He has built everything from feature stores, experimentation platforms, metrics layers, streaming platforms, analytics tools, data discovery systems, and workflow development platforms.

His love of the data space has also allowed him to implement open-source and SaaS products (early and late-stage) and build cutting-edge technology from the ground up.

Datacast Episode 118: Overcoming Hardships, Confident Learning, Dataset Improvement, and The Ph.D. Rapper with Curtis Northcutt

Datacast Episode 118: Overcoming Hardships, Confident Learning, Dataset Improvement, and The Ph.D. Rapper with Curtis Northcutt

Curtis Northcutt is an American computer scientist and entrepreneur focusing on AI to empower people. He is the CEO and Co-Founder of Cleanlab, building next-generation data-centric AI and open-source technologies that enable AI to work with real-world, messy data.

He completed his Ph.D. at MIT, where he invented confident learning to automatically find label issues in any dataset. Curtis received the MIT thesis award, NSF Fellowship, and Goldwater Scholarship for his work. Before Cleanlab, he worked in AI research teams at Google, Oculus, Amazon, Facebook, Microsoft, and NASA.

Datacast Episode 117: Vector Databases, The Embeddings Revolution, and Working in China with Frank Liu

Datacast Episode 117: Vector Databases, The Embeddings Revolution, and Working in China with Frank Liu

Frank Liu is the Director of Operations at Zilliz with nearly a decade of industry experience in machine learning and hardware engineering. Prior to joining Zilliz, Frank co-founded an IoT startup based in Shanghai and worked as an ML Software Engineer at Yahoo in San Francisco. He presents at major industry events such as Open Source Summit and writes tech content for leading publications such as Towards Data Science and DZone. Frank holds MS and BS degrees in Electrical Engineering from Stanford University.

Datacast Episode 116: Distributed Databases, Open-Source Standards, and Streaming Data Lakehouse with Vinoth Chandar

Datacast Episode 116: Distributed Databases, Open-Source Standards, and Streaming Data Lakehouse with Vinoth Chandar

Vinoth Chandar is the creator and PMC chair of the Apache Hudi project, a seasoned distributed systems/database engineer, and a dedicated entrepreneur. He has deep experience with databases, distributed systems, and data systems at the planet scale, strengthened through his work at Oracle, Linkedin, Uber, and Confluent.

During his time at Uber, he created Hudi, which pioneered transactional data lakes as we know them today, to solve unique speed and scale needs for Uber’s massive data platform. Most recently, Vinoth founded Onehouse - a cloud-native managed lakehouse to make data lakes easier, faster, and cheaper.

Datacast Episode 115: Product-Led Sales, Community-Led Category Creation, and Unlocking Revenue Data with Alexa Grabell

Datacast Episode 115: Product-Led Sales, Community-Led Category Creation, and Unlocking Revenue Data with Alexa Grabell

Alexa Grabell is the co-founder and CEO of Pocus, a Revenue Data platform that is purpose-built for GTM teams to analyze, visualize, and action data about their prospects and customers without needing engineers. 

Alexa’s passion for Product-Led Sales started when she led sales strategy & operations at Dataminr, where she built internal solutions to equip sales teams with data. She studied engineering at Vanderbilt University and received her MBA from Stanford University.