May 30, 2026
- Migrated from curl_cffi to wreq.
- Upgraded the collections aggregation strategy to have concurrency at the collections level, resulting in a tremendously faster collections aggregation strategy for stores with more than 25k products.
- Interactive menu-based text-user-interface (TUI) with live on-screen scraping progress.
- Very fast scraping (~ up to 3,000 products/sec).
- Bypasses Cloudflare's anti-bot protections.
- Handles timeouts via auto-retries and exponential back-off.
- Bypasses /products.json endpoint blocks by auto-detecting a store's myshopify.com domain.
- Produces ready-to-import CSVs (with proper column and row-formatting) to allow the user to immediately use the CSVs in Shopify.
- Respects the 15-MB-size and 50,000-row Shopify limits per CSV. For large catalogs, it auto-splits the data into multiple CSVs.
For any Shopify store, the scraper produces a JSON Lines (.jsonl) file that contains the entire product catalog and one or more CSV files with proper formatting for immediate Shopify product import.
For stores with product catalogs of more than 25,000 products, the scraper falls back to the collections aggregation strategy, which makes it slower (mitigated significantly in the May 30, 2026 update).
Make sure you navigate to the project folder, then write the commands below.
MacOS/Linux
python3 -m venv venvWindows
python -m venv venvMacOS/Linux
source venv/bin/activateWindows CMD
.\venv\Scripts\activateWindows Bash
source venv/Scripts/activateMacOS/Linux
pip3 install -r requirements.txtWindows
pip install -r requirements.txtMacOS/Linux
python3 main.pyWindows
python main.py- Press '1' in the main menu screen and press ENTER.
- Type your target store URL (e.g. https://www.gymshark.com/) and press ENTER.
- Type your output CSV name and press ENTER.
- Wait until scraping is complete.
- Enjoy your CSVs.




