Big Data
Blog Posts tagged Big Data#
Resources tagged Big Data#
How Open Source, Python and AI Are Shaping the Data Future with Wes McKinney
The future of analytics isn’t just about bigger models — it’s about building smarter, more interoperable data systems. Wes McKinney, Principal Architect of Posit PBC, Chief Scientist of Voltron Data and a General Partner at Composed Ventures, joins us to explore how the modern data stack is evolving and what it means for the future of analytics. Wes reflects on his journey building pandas and Apache Arrow, sharing how open-source ecosystems grow, transform and shape the way organizations work with data today. Wes also highlights the rising importance of semantic layers, agentic workflows and defensive coding practices as teams embrace AI-driven development.
Key Takeaways:
00:00 Introduction. 02:32 Wes didn’t expect pandas to drive AI but he recognized Python’s unrealized potential. 05:09 A lucky convergence helped Python’s tools snowball into the AI standard. 10:40 Early big data focused on essentials, not the interoperable stacks we rely on today. 15:44 The composable data stack grew through bottom-up, grassroots open-source momentum. 21:56 Many “data science” roles ultimately became business intelligence and dashboard work. 25:24 Complex statistical work still depends on human judgment, not fully autonomous agents. 30:27 Frontier models retrieve table data reliably, while smaller models fail dramatically. 35:16 Better models and coding agents shifted Wes from an AI skeptic to an adopter. 40:07 AI-driven code demands stronger testing and review to avoid costly failures. 45:14 An AI-built finance project ballooned, revealing how agents inflate codebases.
Resources Mentioned:
Wes McKinney https://www.linkedin.com/in/wesmckinn/
Posit PBC | LinkedIn https://www.linkedin.com/company/posit-software/
Posit PBC | Website https://posit.co/
Voltron Data | LinkedIn https://www.linkedin.com/company/voltrondata/
Voltron Data | Website https://voltrondata.com/
Composed Ventures | LinkedIn https://www.linkedin.com/company/composedvc/
Composed Ventures | Website https://composed.vc/
pandas https://pandas.pydata.org/
Apache Arrow https://arrow.apache.org/
DuckDB https://duckdb.org/
DataFusion https://datafusion.apache.org/
Jupyter Notebook https://jupyter.org/
Parquet https://parquet.apache.org/
Iceberg https://iceberg.apache.org/
Delta Lake https://delta.io/
Thanks for listening to the “Data Masters Podcast.” If you enjoyed this episode, be sure to subscribe so you never miss our latest discussions and insights into the ever-changing world of data.
#DataStrategy #DataManagement #DataMastersPodcast
