
Contributed to the dataforgoodfr/13_reveler_inegalites_cinema project by establishing a foundational database schema to support scalable data ingestion and analytics for film festival data. Designed and implemented tables for films, directors, producers, distributors, and festivals, including junction tables and initial seeding with international festival data. Enhanced data quality by introducing enrichment processes, such as new tables for chef positions and Python-based name matching using Levenshtein and Jaro-Winkler algorithms. Improved data ingestion workflows by refactoring parsing logic and strengthening extraction notebooks. Leveraged Python, SQL, and SQLAlchemy to ensure traceable, reproducible changes and enable robust cross-dataset matching for data-driven analysis.
In February 2025, delivered a solid data foundation for the dataforgoodfr/13_reveler_inegalites_cinema project, enabling scalable data ingestion, improved data quality, and reliable analytics for festival coverage and film data. Key outcomes include schema initialization, data enrichment, and robust ingestion notebooks, all designed to support cross-dataset matching and data-driven decision making for inequalities in cinema access.
In February 2025, delivered a solid data foundation for the dataforgoodfr/13_reveler_inegalites_cinema project, enabling scalable data ingestion, improved data quality, and reliable analytics for festival coverage and film data. Key outcomes include schema initialization, data enrichment, and robust ingestion notebooks, all designed to support cross-dataset matching and data-driven decision making for inequalities in cinema access.

Overview of all repositories you've contributed to across your timeline