INSIGHTS
DRILL TO DETAIL PODCAST
Mark Rittman is joined each episode by a special guest from the world of business intelligence, analytics and big data.
Months
- September 2016 2
- October 2016 4
- November 2016 5
- December 2016 3
- January 2017 2
- February 2017 3
- March 2017 4
- April 2017 1
- May 2017 4
- June 2017 4
- July 2017 4
- August 2017 1
- September 2017 1
- October 2017 3
- November 2017 2
- December 2017 4
- January 2018 1
- February 2018 2
- March 2018 1
- April 2018 2
- May 2018 2
- June 2018 2
- July 2018 1
- October 2018 1
- March 2019 2
- April 2019 3
- May 2019 2
- June 2019 3
- July 2019 2
- August 2019 1
- October 2019 1
- November 2019 2
- December 2019 1
- January 2020 1
- April 2020 2
- May 2020 2
- June 2020 1
- July 2020 1
- December 2020 1
- March 2021 3
- April 2021 2
- May 2021 1
- June 2021 2
- March 2022 2
- April 2022 2
- May 2022 2
- February 2023 1
- March 2023 2
- April 2023 2
- May 2023 1
- June 2023 2
- July 2023 2
- August 2023 2
- September 2023 1
- September 2024 1
- October 2024 2
- November 2024 2
Tags
- AI 3
- Airbnb 1
- Airflow 3
- Amazon Athena 2
- Amazon Web Services 1
- Analytics in Startups 5
- Apache Arrow 2
- Apache Drill 2
- Apache Kafka 2
- Apache Kud 1
- Apache Kudu 2
- Artificial Intelligence 5
- Automated Analytics 3
- Bi-Modal Analytics 5
- BiModal IT 2
- Cloudera 2
- Confluent 1
- Consulting 3
- Cube 4
- Customer Data Platform 4
- Dagster 2
- Data Capital 2
- Data Catalogs 1
- Data Discovery 1
- Data Engineering 6
- Data Fabric 1
- Data Governance 6
- Data Lineage 1
- Data Modelling 5
- Data Pipelines 5
- Data Platform Engineers 1
- Data Prep 3
- Data Quality 4
- Data Robot 1
- Data Teams 1
- Data Warehousing 4
- Databricks 1
- Devops 1
- DuckDB 1
- ETL 2
- Elasticsearch 1
- Embedded Analytics 1
- Evaluex 1
- Firebolt 1
- Fishtown Analytics 1
- FiveTran 2
- GA4 1
- GDPR 2
- Gartner 5
- Gluent 2
Drill to Detail Ep.44 'Pandas, Apache Arrow and In-Memory Analytics' With Special Guest Wes McKinney
Mark is joined in this episode of Drill to Detail by Wes McKinney, to talk about the origins of the Python Pandas open-source package for data analysis and his subsequent work as a contributor to the Kudu (incubating) and Parquet projects within the Apache Software Foundation and Arrow, an in-memory data structure specification for use by engineers building data systems and the de-facto standard for columnar in-memory processing and interchange.
Mark is joined in this episode of Drill to Detail by Wes McKinney, to talk about the origins of the Python Pandas open-source package for data analysis and his subsequent work as a contributor to the Kudu (incubating) and Parquet projects within the Apache Software Foundation and Arrow, an in-memory data structure specification for use by engineers building data systems and the de-facto standard for columnar in-memory processing and interchange.
- Python Data Analysis Library
- "Ibis on Impala: Python at Scale for Data Science"
- Drill To Detail Ep.3 'Apache Kudu And Cloudera's Analytic Platform' With Special Guest Mike Percy
- Apache Arrow homepage
- "Apache Arrow and the "10 Things I Hate About pandas"
- "Apache Arrow vs. Parquet and ORC: Do we really need a third Apache project for columnar data representation?"
- "Some comments to Daniel Abadi's blog about Apache Arrow"
- Wes McKinney homepage
Drill to Detail Ep.13 ‘Apache Drill, MapR + Bringing Data Discovery to Hadoop & NoSQL’ with Special Guest Neeraja Rentachintala
Mark Rittman is joined by MapR's Neeraja Rentachintala to talk about Apache Drill, Apache Arrow, MapR-DB, extending Hadoop-based data discovery to self-describing file formats and NoSQL databases, and why MapR backed Drill as their strategic SQL-on-Hadoop platform technology.