This repo contains the code to download judgments from ecourts website. Data for bulk download is available freely from AWS. It contains judgments from 1950 to Present, along with raw metadata (in json format) and structured metadata(parquet format). Judgments are available in both English and regional Indian languages in zip format for easier download.
- The data is licensed under Creative Commons Attribution 4.0 (CC-BY-4.0), which means you are free to use, share, and adapt the data as long as you provide appropriate attribution.
- AWS sponsors the storage and data transfer costs of the data.
- Join discord server if you want to collaborate on this repo or have any questions.
- Be responsible, considerate, and think about the maintainers of the ecourts website. Avoid scraping with high concurrency.
- Data downloaded using previous version of this repo from an API is availavble in Kaggle. Code for that is available in the branch old
- From 1950 to Present
- ~35K judgments in english, some of which have regional language versions.
- ~52.24GB of data (see dataset_sizes.csv for detailed breakdown)
- Any metadata about judgment like Disposal nature, Decision date etc have also been part of the dataset.
s3://indian-supreme-court-judgments/
├── data/
│ └── zip/
│ └── year=YYYY/
│ ├── english.zip
│ ├── english.index.json
│ ├── regional.zip
│ └── regional.index.json
└── metadata/
├── zip/
│ └── year=YYYY/
│ ├── metadata.zip
│ └── metadata.index.json
└── parquet/
└── year=YYYY/
└── metadata.parquet
Where YYYY represents the year (1950-2025).
Each year has following data:
- English judgments (english.zip)
- Regional language judgments (regional.zip)
- Metadata (metadata.zip and metadata.parquet)
- index.json files that contain info about the files in the zip files
Columns/fields in the metadata.parquet are
- title
- petitioner
- respondent
- description
- judge
- author_judge
- citation
- case_id
- cnr
- decision_date
- disposal_nature
- court
- available_languages
- raw_html
- path
- nc_display
- scraped_at
- year
- Example command to list all available years:
aws s3 ls s3://indian-supreme-court-judgments/data/zip --no-sign-request - Example command to download English judgments for 2023:
aws s3 cp s3://indian-supreme-court-judgments/data/zip/year=2023/english.zip . --no-sign-request - Example command to view metadata index for 2023:
aws s3 cp s3://indian-supreme-court-judgments/data/zip/year=2023/metadata.index.json . --no-sign-request - Since the S3 bucket is public, files can also be downloaded using links like
https://indian-supreme-court-judgments.s3.amazonaws.com/data/zip/year=2023/english.zip
See the AWS tutorials for more detailed examples of:
- Downloading and extracting judgment data
- Querying metadata using AWS Athena here
- install uv
- install dependencies:
uv sync source .venv/bin/activatepython3 download.py- VS Code extensions:
Python,Pylance,ruff