DEF CON Insights is a data analysis project that transforms DEF CON conference data into clean, structured formats for deeper exploration and visualization.
This repository currently provides scripts to fetch and process structured content (events, tags, speakers, locations) from DEF CONβs Firebase backend. A Quarto site with published statistics, trends, and visual insights is planned.
- Fetch raw data from DEF CONβs Firebase/Firestore
- Normalize and flatten nested fields into analysis-ready tables
- Combine multiple years for cross-year comparisons
scripts/
βββ 01_fetch_firebase_data.py # Downloads raw JSON data for a given DEF CON year
βββ 02_process_defcon_data.py # Transforms verbose Firestore exports into flat CSV/JSON
βββ 03_combine_all_years.py # Merges all processed files into a single dataset
run_pipeline.sh # Orchestrates the full data pipeline for selected DEF CON yearsYou can run the full pipeline for DEF CON 32 and 33 using:
bash run_pipeline.shThis will:
- Fetch raw data into
data/raw/ - Process cleaned files into
data/processed/ - Merge all processed years into
data/combined/defcon_all_events.csv
This project leverages:
- Polars for fast, multi-threaded DataFrame transformations
- Requests to fetch raw JSON from Firebase
- Quarto (coming) for analysis notebooks and site publishing
- Plotnine (coming) for grammar-of-graphics plotting and visual insights
This project is MIT-licensed. See LICENSE for details.