INSIGHTS
DRILL TO DETAIL PODCAST
Mark Rittman is joined each episode by a special guest from the world of business intelligence, analytics and big data.
Months
- September 2016 2
- October 2016 4
- November 2016 5
- December 2016 3
- January 2017 2
- February 2017 3
- March 2017 4
- April 2017 1
- May 2017 4
- June 2017 4
- July 2017 4
- August 2017 1
- September 2017 1
- October 2017 3
- November 2017 2
- December 2017 4
- January 2018 1
- February 2018 2
- March 2018 1
- April 2018 2
- May 2018 2
- June 2018 2
- July 2018 1
- October 2018 1
- March 2019 2
- April 2019 3
- May 2019 2
- June 2019 3
- July 2019 2
- August 2019 1
- October 2019 1
- November 2019 2
- December 2019 1
- January 2020 1
- April 2020 2
- May 2020 2
- June 2020 1
- July 2020 1
- December 2020 1
- March 2021 3
- April 2021 2
- May 2021 1
- June 2021 2
- March 2022 2
- April 2022 2
- May 2022 2
- February 2023 1
- March 2023 2
- April 2023 2
- May 2023 1
- June 2023 2
- July 2023 2
- August 2023 2
- September 2023 1
- September 2024 1
- October 2024 2
- November 2024 1
Tags
- AI 3
- Airbnb 1
- Airflow 3
- Amazon Athena 2
- Amazon Web Services 1
- Analytics in Startups 5
- Apache Arrow 2
- Apache Drill 2
- Apache Kafka 2
- Apache Kud 1
- Apache Kudu 2
- Artificial Intelligence 5
- Automated Analytics 3
- Bi-Modal Analytics 5
- BiModal IT 2
- Cloudera 2
- Confluent 1
- Consulting 3
- Cube 3
- Customer Data Platform 4
- Dagster 2
- Data Capital 2
- Data Catalogs 1
- Data Discovery 1
- Data Engineering 6
- Data Fabric 1
- Data Governance 6
- Data Lineage 1
- Data Modelling 5
- Data Pipelines 5
- Data Platform Engineers 1
- Data Prep 3
- Data Quality 4
- Data Robot 1
- Data Teams 1
- Data Warehousing 4
- Databricks 1
- Devops 1
- DuckDB 1
- ETL 2
- Elasticsearch 1
- Embedded Analytics 1
- Evaluex 1
- Firebolt 1
- Fishtown Analytics 1
- FiveTran 2
- GA4 1
- GDPR 2
- Gartner 5
- Gluent 2
Drill to Detail Ep.88 'Superset, Preset and the Future of Business Intelligence' with Special Guest Maxime Beauchemin
Maxime Beauchemin returns to the Drill to Detail Podcast and joins Mark Rittman to talk about what's new with Apache Airflow 2.0, the origin story for Apache Superset and now Preset.io, why the future of business intelligence is open source and news on Marquez, a reference implementation of the OpenLineage open source metadata service for the collection, aggregation, and visualization of a data ecosystem’s metadata sponsored by WeWork.
Drill to Detail Ep.82 'Looker Development, Automated Testing and Spectacles' with Special Guest Josh Temple
Mark Rittman is joined in this episode by Josh Temple, Analytics Engineer at Spotify to talk about analytics development, automated testing and Spectacles, an open-source tool and SaaS service that automatically tests your LookML to ensure Looker always runs smoothly for your users.
Drill to Detail Ep.81 'Meltano, Singer Taps and Open-Source Data Pipelines' with Special Guest Douwe Maan
Mark Rittman is joined in this episode by Gitlab.com Lead Developer Douwe Mann to discuss the history of the open-source Meltano project and its recent refocus on becoming the "glue" that creates open-source data pipelines, using plugin technologies such as Singer Taps and dbt transformations.
Mark Rittman is joined in this episode by GitLab.com Lead Developer Douwe Maan to discuss the history of the open-source Meltano project and its recent refocus on becoming the "glue" that creates open-source data pipelines, using plugin technologies such as Singer Taps and dbt transformations.
Drill to Detail Ep.48 'Mondrian OLAP, Apache Calcite and Database Dis-Aggregation' With Special Guest Julian Hyde
Drill to Detail returns after the New Year break with Special Guest Julian Hyde from Hortonworks to talk about bitmap indexes and CASE tools, Mondrian and open-source OLAP analysis, and Apache Calcite's mission to bring sanity, cost-based optimisers and support for OLAP workloads to today's dis-aggregated, distributed new-world database engines.
- Oracle Designer page on Oracle.com
- Bitmap Index page on Wikipedia
- Mondrian project page on Github
- Mondrian OLAP Server page on Wikipedia
- MultiDimensional eXpressions (MDX) page on Wikipedia
- Julian Hyde blog
- Apache Calcite project homepage
- Apache Calcite Introduction and Overview deck
- Streaming SQL presentation at Apex Big Data World 2017, Mountain View, California
Drill to Detail Ep.44 'Pandas, Apache Arrow and In-Memory Analytics' With Special Guest Wes McKinney
Mark is joined in this episode of Drill to Detail by Wes McKinney, to talk about the origins of the Python Pandas open-source package for data analysis and his subsequent work as a contributor to the Kudu (incubating) and Parquet projects within the Apache Software Foundation and Arrow, an in-memory data structure specification for use by engineers building data systems and the de-facto standard for columnar in-memory processing and interchange.
Mark is joined in this episode of Drill to Detail by Wes McKinney, to talk about the origins of the Python Pandas open-source package for data analysis and his subsequent work as a contributor to the Kudu (incubating) and Parquet projects within the Apache Software Foundation and Arrow, an in-memory data structure specification for use by engineers building data systems and the de-facto standard for columnar in-memory processing and interchange.
- Python Data Analysis Library
- "Ibis on Impala: Python at Scale for Data Science"
- Drill To Detail Ep.3 'Apache Kudu And Cloudera's Analytic Platform' With Special Guest Mike Percy
- Apache Arrow homepage
- "Apache Arrow and the "10 Things I Hate About pandas"
- "Apache Arrow vs. Parquet and ORC: Do we really need a third Apache project for columnar data representation?"
- "Some comments to Daniel Abadi's blog about Apache Arrow"
- Wes McKinney homepage