Data Integration

Datacast Episode 103: Computational Economics, Statistical Arbitrage, and Adaptable Data Consolidation with Eric Daimler

Datacast Episode 103: Computational Economics, Statistical Arbitrage, and Adaptable Data Consolidation with Eric Daimler

Dr. Eric Daimler is an authority in Artificial Intelligence with over 20 years of experience in the field as an entrepreneur, executive, investor, technologist, and policy advisor. Eric has co-founded six technology companies that have done pioneering work in areas ranging from software systems to statistical arbitrage.

As a Presidential Innovation Fellow during the Obama Administration, Eric helped drive the agenda for U.S. leadership in research, commercialization, and public adoption of AI. He has also served as the Assistant Dean and an Assistant Professor of Software Engineering at Carnegie Mellon’s School of Computer Science. He specializes in public policy and economics, helped launch Carnegie Mellon’s Silicon Valley Campus, and founded its Entrepreneurial Management program. His academic research focuses on the intersection of Machine Learning, Computational Linguistics, and Network Science.

As a frequent keynote speaker, Eric has presented at venues including the engineering schools of MIT, Stanford, and Harvard. He studied at Stanford University, the University of Washington-Seattle, and Carnegie Mellon University, where he earned his Ph.D. in its School of Computer Science.

Datacast Episode 95: Open-Source DataOps, Building In Public, and Remote Work Culture with Douwe Maan

Datacast Episode 95: Open-Source DataOps, Building In Public, and Remote Work Culture with Douwe Maan

Douwe Maan is the founder and CEO of Meltano, an open-source DataOps platform. Before joining Meltano, he was hired as the tenth employee at GitLab, later becoming an Engineering Manager. While at GitLab, he spent six months traveling the world, visiting colleagues in 14 different countries. In 2019, he joined the internal Meltano project at GitLab and quickly became its General Manager. In early 2021, Douwe led Meltano in spinning out of GitLab to become an independent startup, raising seed funding from investors led by Alphabet's GV.

Datacast Episode 75: Commoditizing Data Integration Pipelines with Michel Tricot

Datacast Episode 75: Commoditizing Data Integration Pipelines with Michel Tricot

Michel Tricot has been working in data engineering for 15 years. Originally from France, Michel came to the US in 2011 to join a small startup named LiveRamp. As the company grew, he became the Head of Integrations and Director of Engineering, where his team built and scaled over 1,000 data ingestion and distribution connectors to replicate hundreds of TB worth of data every day. 

After LiveRamp’s acquisition and later IPO (NYSE:RAMP), he wanted to return to an early-stage startup. So he joined rideOS as Director of Engineering, again deep in data engineering. While there, he realized that companies were always trying to solve the same problem repeatedly, which should be solved once and for all. 

This was when he decided to start a new company, and Airbyte was born.

An Introduction to Big Data: Data Integration

An Introduction to Big Data: Data Integration

This semester, I’m taking a graduate course called Introduction to Big Data. It provides a broad introduction to the exploration and management of large datasets being generated and used in the modern world. In an effort to open-source this knowledge to the wider data science community, I will recap the materials I will learn from the class in Medium. Having a solid understanding of the basic concepts, policies, and mechanisms for big data exploration and data mining is crucial if you want to build end-to-end data science projects.