mirror of
https://github.com/jlengrand/tldw.git
synced 2026-03-10 08:51:17 +00:00
aa9a2ff806ebcdc765a361b689f05aecafd25e4d
Too Long, Didnt Watch
YouTube contains an incredible amount of knowledge, much of which is locked inside multi-hour videos. Let's extract and summarize with AI!
-
diarize.py- download, transrcibe and diarize audio- yt-dlp - download audio tracks of youtube videos
- ffmpeg - decompress audio
- faster_whisper - speech to text
- pyannote - diarization
-
chunker.py- break text into parts and prepare each part for LLM summarization -
roller-*.py- rolling summarization- can-ai-code - interview executors to run LLM inference
-
compare.py- prepare LLM outputs for webapp -
compare-app.py- summary viewer webapp
This project is under active development and is not ready for production use.
Description
Languages
Python
99.5%
Dockerfile
0.5%