Research
My work sits at the intersection of legal informatics, natural language processing, and applied machine learning. I study whether individual judges exhibit stable, learnable decision-making patterns that diverge measurably from aggregate court behaviour. This research programme has two pillars: developing reliable methods for extracting structured data from unstructured judicial text, and training specialist predictive models that capture judge-specific decisional profiles.
My most recent study, The Judge Variable (2025), uses 18,937 living arrangements rulings from 10,306 child custody cases in French appellate courts. Specialist models trained on individual judges' past rulings achieve F1 scores as high as 92.85%, compared to 82.63% for a generalist model trained on 20 to 100 times more data. In-domain and cross-domain validity tests provide empirical support for the legal realist hypothesis that judicial identity plays a measurable role in case outcomes.
Legal NLP Judicial Prediction Legal Realism LLM Data Extraction Prompt Engineering Feature Engineering Open Data (Judilibre) French Case Law
Publications
Tools & Data
LILA: Litigation Data Extraction with LLMs
Datasets and code for the prompt engineering study on zero-shot legal data extraction from French court decisions.
Papers with Code & Data
Replication packages for published studies will be made available here as they are prepared for open access.
Contact
guillaume.zambrano [at] unimes.fr
arXiv · HAL · ResearchGate · dblp · Academia.edu · Portail Universitaire du Droit