Forecasting the Supreme Court: A Comparative Analysis of Machine Learning Algorithms on Petitioner vs. Appellee Outcomes

dc.contributorDr. George Dimitoglou
dc.contributorDr. Jill Tysse
dc.contributorDr. Jiang Li
dc.contributor.advisorDr. George Dimitoglou
dc.contributor.authorBenjamin Chase Davids
dc.contributor.departmentHood College Department of Computer Science and Information Technology
dc.contributor.programHood College Departmental Honors
dc.date.accessioned2024-04-25T17:57:00Z
dc.date.available2024-04-25T17:57:00Z
dc.date.issued2024-04-25
dc.descriptionThis research seeks to address the question of whether algorithms can accurately predict petitioner-appellee outcomes of U.S. Supreme Court cases. We compare 4 machine learning algorithms, and we do a SHAP feature analysis on the most accurate algorithm, LightGBM.
dc.description.abstractSince its inception, Supreme Court decisions have impacted American laws and life. The ability to predict the high court’s decisions, known as quantitative legal prediction, would be of interest to those in the legal profession and the general public. While much research has been conducted on quantitative legal prediction, for various foreign high courts, the few experiments that have specifically addressed United States Supreme Court cases are now outdated, have been prone to overfitting, or were based on limited datasets. Our work and experimentation attempt to predict case outcomes while addressing the shortcomings of past research. In this work, we deployed several machine learning algorithms to predict whether the petitioner or appellee will win a Supreme Court case and compared the algorithms based on their prediction accuracy. Finally, we embarked on identifying which case features have the greatest predictive impact on the winner of a case. Using four machine learning algorithms (Random Forest, XGBoost, LightGBM, and Multilayer Perceptron) we trained, evaluated, and tested the predictive accuracy on the Washington University School of Law dataset of over 8,000 Supreme Court cases that were litigated between 1946 to 2016. Success was measured via a model’s accuracy, AUROC, and the associated weighted F1 score. Three of the four algorithms achieved accuracy, AUROC, and weighted F1 score in the mid-0.70s with LightGBM being the most accurate. The three case features that most influence LightGBM’s performance are the reason the Supreme Court granted a petition for certiorari, the category of the appellee, and the category of the petitioner. High performing algorithms and models such as the ones we have deployed could provide some predictive insight to individuals, lawyers, and policymakers that may be affected by Supreme Court decisions. Future research directions may include training the algorithms using semantically meaningful textual data or additional case variables.
dc.format.extent63 pages
dc.genreDepartmental Honors Thesis
dc.genreTischer Departmental Honors Paper
dc.genreDepartmental Honors Research Paper
dc.genreHood College Departmental Honors Paper
dc.identifierdoi:10.13016/m2hz1t-2jhv
dc.identifier.urihttp://hdl.handle.net/11603/33254
dc.language.isoen_US
dc.rightsThis thesis is the intellectual property of Benjamin Chase Davids and has been submitted to the Department of Computer Science and Information Technology and the Honors Department at Hood College in partial fulfillment of the requirements for the computer science and departmental honors program. This thesis is available for unrestricted access and dissemination under the terms of the Creative Commons Attribution-NonCommercial-ShareAlike (CC BY-NC-SA) 4.0 International License (https://creativecommons.org/). You are free to share and adapt the material, but you must give appropriate credit, not use it for commercial purposes, and share any derivative works under the same license.
dc.rightsAttribution-NonCommercial-ShareAlike 3.0 United Statesen
dc.rights.urihttp://creativecommons.org/licenses/by-nc-sa/3.0/us/
dc.subjectmachine learning
dc.subjectprediction
dc.subjectSupreme Court
dc.subjectpetitioner
dc.subjectappellee
dc.titleForecasting the Supreme Court: A Comparative Analysis of Machine Learning Algorithms on Petitioner vs. Appellee Outcomes
dc.typeText

Files

Original bundle
Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
Forecasting the Supreme Court.pdf
Size:
1.05 MB
Format:
Adobe Portable Document Format
License bundle
Now showing 1 - 1 of 1
No Thumbnail Available
Name:
license.txt
Size:
1.65 KB
Format:
Item-specific license agreed upon to submission
Description: