π¬ Predoctoral Researcher, Ellison Institute of Technology Oxford, Limited
π BSc Statistics, University of British Columbia (UBC)
π§ Applied Econometrics β’ Statistical Modelling β’ Data Systems
πΌ LinkedIn β’ π Research β’ π CV β’ βοΈ Email
I work at the intersection of economics, data science, and research engineering, designing end-to-end systems for empirical analysis.
I focus on problems where measurement is difficult, data is imperfect, and decisions depend on credible empirical structure.
My work starts with large, messy, or fragmented data, moves through data engineering and dataset construction, and culminates in papers, structured evidence, or decision-ready outputs.
Across projects, I prioritise measurement, scalability, and empirical designs that make complex systems interpretable.
- Healthcare Productivity, Access, and Inequality
- Labour Markets, AI Diffusion, and Technology Adoption
- Bayesian Forecasting and Model Uncertainty
- Privacy, Disclosure, and Data Infrastructure
- NLP, Entity Resolution, and Document-Level Extraction
Raw / Unstructured Data
β
Data Collection & Scraping
β
Cleaning & Normalisation
β
Dataset Construction
β
Modelling & Analysis
β
Papers β’ Insights β’ Decision Outputs
| Area | Focus |
|---|---|
Current role |
Predoctoral Researcher in Economics, Ellison Institute of Technology Oxford, Limited |
What I build |
End-to-end data pipelines, analytical datasets, and paper-linked empirical outputs |
Methods |
Panel data, fixed effects, Bayesian state-space models, forecasting, NLP, entity resolution |
Tools |
Python, R, SQL, Spark, Stata, Git, LaTeX |
Scale |
100k+ documents β’ 300M+ records |
π Independent Research
Public archive of research papers with linked codebases.
π€ AI Occupation Adoption Gap Analysis
Empirical analysis of the gap between theoretical AI exposure and observed workplace adoption.
π Bayesian Unemployment Forecasting Analysis
State-space modelling of unemployment with a focus on structural trends and predictive uncertainty.
πΈοΈ Darkweb Text Analysis Pipeline
Large-scale text processing system for forum and marketplace data.
π§© SDK Release Notes Pipeline
A workflow for collecting, parsing, and normalising SDK release notes from GitHub and vendor-hosted changelogs.
π‘οΈ CVE Vulnerability Exposure Pipeline
Integrated dataset combining NVD, CVE, EPSS, and KEV for vulnerability risk analysis.
- The Capability-Use Gap in Occupational AI Diffusion
- Latent Trend Discipline and Forecast Uncertainty in U.S. Unemployment
Research / Predoc
β Independent Research
β AI Occupation Adoption Gap Analysis
β Bayesian Unemployment Forecasting Analysis
Data Science / Analytics
β CVE Vulnerability Exposure Pipeline
β Finance Job Postings Pipeline
β Privacy Disclosures Pipeline
Research Engineering / NLP
β Darkweb Text Analysis Pipeline
β SDK Release Notes Pipeline
β Analyst Report Entity Resolution
Many projects rely on licensed or restricted datasets. Where raw data cannot be shared, I prioritise:
- Transparent pipeline structure
- Clear schema and transformation logic
- Reproducible workflows with sample or public inputs
The goal is to make methods and reasoning inspectable, even when data is not.