mirror of https://github.com/jlengrand/tldw.git synced 2026-03-10 08:51:17 +00:00

Go to file

Robert b4a32d6014 Making progress...

Added sanity checks for file existence, file naming(remove illegal windows filenames and normalize to ascii), cuda existence,

2024-05-04 13:59:57 -07:00

.github/workflows

Create pylint.yml

2024-04-30 09:51:06 -07:00

Bin

I love python packages

2024-04-30 20:50:07 -07:00

compare

more ufo compares

2023-07-30 15:43:54 -04:00

data

a few more summaries

2023-08-15 02:16:54 +00:00

params

consitent naming

2023-07-30 13:05:26 -04:00

prompts

switch to json instead of text so we can chunk by time

2023-08-02 18:45:21 -04:00

results

run the 4 testcases

2023-08-02 19:07:53 -04:00

tldw-scripts

cleanup main folder

2024-04-30 21:08:56 -07:00

.gitignore

Making progress...

2024-05-04 13:59:57 -07:00

diarize.py

Making progress...

2024-05-04 13:59:57 -07:00

README.md

Updated README instructions and clarified how things work.

2024-04-30 21:07:26 -07:00

requirements.txt

I love python packages

2024-04-30 20:50:07 -07:00

README.md

TL/DW: Too Long, Didnt Watch

YouTube contains an incredible amount of knowledge, much of which is locked inside multi-hour videos. Let's extract and summarize with AI!

Pieces

diarize.py - download, transcribe and diarize audio
1. First uses yt-dlp to download audio(optionally video) from supplied URL
2. Next, it uses ffmpeg to convert the resulting .m4a file to .wav
3. Then it uses faster_whisper to transcribe the .wav file to .txt
4. After that, it uses pyannote to perform 'diarorization'
5. Finally, it'll send the resulting txt to an LLM endpoint of your choice for summarization of the text.
- Goal is to support OpenAI/Claude/Cohere/Groq/local OpenAI endpoint (oobabooga/llama.cpp/exllama2) so you can either do a batch query to X endpoint, or just feed them one at a time. Your choice.
chunker.py - break text into parts and prepare each part for LLM summarization
roller-*.py - rolling summarization
- can-ai-code - interview executors to run LLM inference
compare.py - prepare LLM outputs for webapp
compare-app.py - summary viewer webapp

Setup

Linux
1. Download necessary packages (Python3, ffmpeg[sudo apt install ffmpeg / dnf install ffmpeg], ?)
2. Create a virtual env: python -m venv ./
3. Launch/activate your virtual env: . .\scripts\activate.sh
4. See Linux && Windows
Windows
1. Download necessary packages (Python3, ffmpeg, ?)
2. Create a virtual env: python -m venv .\
3. Launch/activate your virtual env: . .\scripts\activate.ps1
4. See Linux && Windows
Linux && Windows
1. pip install -r requirements.txt - may take a bit of time...
2. Run python ./diarize.py <video_url> - The video URL does not have to be a youtube URL. It can be any site that ytdl supports.
3. You'll then be asked if you'd like to run the transcription through GPU(1) or CPU(2).
4. Next, the video will be downloaded to the local directory by ytdl.
5. Then the video will be transcribed by faster_whisper. (You can see this in the console output) * The resulting transcription output will be stored as both a json file with timestamps, as well as a txt file with no timestamps.
6. Finally, you can have the transcription summarized through feeding it into an LLM of your choice.
7. For running it locally, here's the commands to do so: * FIXME
8. For feeding the transcriptions to the API of your choice, simply use the corresponding script for your API provider. * FIXME: add scripts for OpenAI api (generic) and others

README.md

TL/DW: Too Long, Didnt Watch

Pieces

Setup

Credits