PC Part Dataset
A dataset of PC parts scraped from PCPartPicker.
Part count: 60,506
Last updated: March 27, 2024
Download
The parts are available in JSON, JSON Lines, and CSV format. You can find them in the ./data
directory.
JSON Property Descriptions
Check out API.md for JSON property descriptions of each product category.
Contents
Running the Scraper
Warning
Make sure to turn on your VPN before running this scraper!
- Run
npm install
- Run
npm run start
- Wait about 5-10 minutes for the scraper to finish doing its thing.
- Everything will be emitted to a directory named
data-staging
.
If you don't want to scrape every part, you can specify individual parts by passing them as arguments. You can find the list of part type IDs in API.md.
Example usage:
# Only get CPUs and video cards
npm run start -- cpu video-card
package.json
Scripts
start
=> Starts the scraper.
dev
=> Starts the scraper without typechecking.
count
=> Runs counter.ts
; gets the number of parts from ./data-staging/json
or argv[1].
output
=> Runs output.ts
; outputs JSONL and CSV formats to ./data-staging
or argv[1].
zip
=> Zips the JSON, JSONL, and CSV folders in ./data
.
License
MIT