Show HN: Turn documents into lip-synced video readers

Category: library

Tags: document-to-video, lip-sync, tts, text-highlighting, python

Score: 7.0/10 (Innovation: 7, Technical: 7, Documentation: 8, Utility: 6)

Screencastgen converts PDF, EPUB, and plain-text documents into audio-synchronized video readers with text highlighting and lip-sync, using pluggable TTS and alignment providers like Qwen, WhisperX, and LatentSync. Its combination of document processing, narration, and AI-driven video generation in a single pipeline is innovative, and it includes a full-stack web application for accessibility. The project is technically complex and well-documented, but currently has limited adoption and moderate utility outside niche content creation use cases.

Target audience: backend devs, devops, content creators

Repository: https://shashekhar.github.io/screencastgen/demo-reader/ · Python · Apache-2.0

View on Hacker News