Show HN: Sanjaya – Academic paper discovery and extraction (OpenAlex/Scrapy)
Category: infrastructure
Tags: academic-research, data-extraction, web-scraping, react, fastapi
Score: 4.5/10 (Innovation: 4, Technical: 5, Documentation: 4, Utility: 5)
Sanjaya is a decoupled web application for academic paper discovery and extraction, integrating the OpenAlex API with Scrapy and Playwright for scraping journals in English and Mandarin. It provides structured export to CSV, JSON, and ZIP, making it a practical tool for researchers collecting academic datasets. While functional, it lacks novel technical contributions and has limited documentation beyond basic setup.
Target audience: data engineers, researchers
Repository: https://sanjaya-six.vercel.app/ · TypeScript · MIT · 1 stars
View on Hacker News