
Sangphet developed data cleaning and analysis workflows for the BU-Spark/ds-bcc-liz-breadon-accountability repository, focusing on building and property violations from 2016 onward. Using Python, Pandas, and Jupyter Notebooks, Sangphet created reproducible pipelines to clean, merge, and enrich violation datasets with student housing addresses, exporting results as CSVs for downstream analytics. The work included mapping violation descriptions to severity levels and preparing data structures for visualization and landlord analytics in Looker Studio. By emphasizing traceable commits and scalable workflows, Sangphet improved data quality and enabled faster dashboard setup, supporting data-driven compliance monitoring and communication with landlords and stakeholders.

December 2024 monthly summary for BU-Spark/ds-bcc-liz-breadon-accountability: Delivered a new Building Violations Analysis Notebook that maps violation descriptions to severity levels, exports processed data to CSV, and prepares data for visualization and landlord analytics in Looker Studio. The work enhances risk visibility, enables proactive compliance insights, and lays the foundation for data-driven landlord communications.
December 2024 monthly summary for BU-Spark/ds-bcc-liz-breadon-accountability: Delivered a new Building Violations Analysis Notebook that maps violation descriptions to severity levels, exports processed data to CSV, and prepares data for visualization and landlord analytics in Looker Studio. The work enhances risk visibility, enables proactive compliance insights, and lays the foundation for data-driven landlord communications.
October 2024: Delivered a data cleaning and Looker Studio integration feature in BU-Spark/ds-bcc-liz-breadon-accountability. Key outcomes include a cleaned 2016+ violations dataset with refined addresses and contact info, exported as CSV, plus a Looker Studio-ready workflow to merge violations with student housing addresses for dashboard analysis (pie chart of top violations). Major bugs fixed: None reported this month. Overall impact and accomplishments: Improved data quality and accessibility of violation analytics, enabling faster dashboard readiness and data-driven decision-making for housing and compliance initiatives. The reproducible notebooks and CSV exports lay groundwork for scalable analytics and ongoing monitoring of top violation types. Technologies/skills demonstrated: Python data wrangling, Jupyter notebooks, CSV I/O, data merging/enrichment, Looker Studio integration, data visualization preparation, and emphasis on reproducible workflows with commit traceability.
October 2024: Delivered a data cleaning and Looker Studio integration feature in BU-Spark/ds-bcc-liz-breadon-accountability. Key outcomes include a cleaned 2016+ violations dataset with refined addresses and contact info, exported as CSV, plus a Looker Studio-ready workflow to merge violations with student housing addresses for dashboard analysis (pie chart of top violations). Major bugs fixed: None reported this month. Overall impact and accomplishments: Improved data quality and accessibility of violation analytics, enabling faster dashboard readiness and data-driven decision-making for housing and compliance initiatives. The reproducible notebooks and CSV exports lay groundwork for scalable analytics and ongoing monitoring of top violation types. Technologies/skills demonstrated: Python data wrangling, Jupyter notebooks, CSV I/O, data merging/enrichment, Looker Studio integration, data visualization preparation, and emphasis on reproducible workflows with commit traceability.
Overview of all repositories you've contributed to across your timeline