Commit Graph

13267 Commits (c289c867414f11a2ff1e53887e484f607bb42844)

Author SHA1 Message Date
Chocobozzz c289c86741
Add vad_filter to ctranslate transcriber
Helps us to correctly detect the language if there is no voice in the
first 30 seconds

Also helps to lower hallucinations
2024-07-03 15:23:26 +02:00
Chocobozzz fb5236f2af
Bumped to version v6.2.0-rc.1 2024-07-03 09:31:32 +02:00
Chocobozzz f5104fb234
Don't transcribe/encode videos with invalid state 2024-07-03 09:09:20 +02:00
Chocobozzz 2d26eff129
Don't create pendingTranscription twice 2024-07-03 08:52:05 +02:00
Chocobozzz 91302ea12a
Fix CI stats task 2024-07-02 14:06:52 +02:00
Chocobozzz 0870a3fe51
Fix check logs params tests 2024-07-02 13:16:59 +02:00
Chocobozzz 532d8d4a39
Update translations 2024-07-02 11:42:21 +02:00
Jeff Huang 81455aa2f1
Translated using Weblate (Chinese (Traditional))
Currently translated at 100.0% (2420 of 2420 strings)

Translation: PeerTube/angular
Translate-URL: https://weblate.framasoft.org/projects/peertube/angular/zh_Hant/
2024-07-02 11:29:57 +02:00
josé m 19ad60aa5e
Translated using Weblate (Galician)
Currently translated at 100.0% (2420 of 2420 strings)

Translation: PeerTube/angular
Translate-URL: https://weblate.framasoft.org/projects/peertube/angular/gl/
2024-07-02 11:29:57 +02:00
Danail Emandiev 3627d15cea
Translated using Weblate (Bulgarian)
Currently translated at 72.9% (186 of 255 strings)

Translation: PeerTube/server
Translate-URL: https://weblate.framasoft.org/projects/peertube/server/bg/
2024-07-02 11:29:57 +02:00
Danail Emandiev de18b731b4
Translated using Weblate (Bulgarian)
Currently translated at 100.0% (127 of 127 strings)

Translation: PeerTube/player
Translate-URL: https://weblate.framasoft.org/projects/peertube/player/bg/
2024-07-02 11:29:57 +02:00
Danail Emandiev 3269e0b138
Translated using Weblate (Bulgarian)
Currently translated at 16.4% (215 of 1305 strings)

Translation: PeerTube/angular
Translate-URL: https://weblate.framasoft.org/projects/peertube/angular/bg/
2024-07-02 11:29:57 +02:00
Chocobozzz 044bd2300d
Prepare changelog for next release 2024-07-02 11:29:42 +02:00
Chocobozzz 5ab681802b
Add ability to force transcription 2024-07-02 11:22:51 +02:00
Chocobozzz ad9eb48bab
Display appropriate message on bad req 2024-07-02 11:02:56 +02:00
Chocobozzz bd87b4271b
Ensure package dependencies are built 2024-07-02 10:46:29 +02:00
Chocobozzz f1aae9163f
Fix hls seeking 2024-07-02 10:45:42 +02:00
Chocobozzz f81bc3d3dc
Relax client log rate limiting 2024-07-02 09:43:07 +02:00
Chocobozzz 01b3396c80
Better view handling with looped videos 2024-07-02 09:41:33 +02:00
Chocobozzz 9ee467b9cd
Better whisper logging 2024-07-02 09:01:54 +02:00
Chocobozzz ebcf3329f5
More robust transcription error job handler 2024-07-01 14:38:19 +02:00
Chocobozzz 85678854c2
Add transcription doc for remote runners 2024-07-01 14:32:53 +02:00
Chocobozzz a7fc7c97a6
Update peertube runner version
Includes transcription
2024-07-01 11:30:13 +02:00
Chocobozzz b45ed3c05a
Do not run transcription/transcoding on lives 2024-07-01 11:15:09 +02:00
Chocobozzz 45d22afca6
Add manual run to docker ci build 2024-07-01 10:53:19 +02:00
Chocobozzz dcfd83350a
Faster CI docker build 2024-07-01 10:50:29 +02:00
Chocobozzz 3b5b050a98
Include transcription package in builds 2024-07-01 10:12:48 +02:00
Chocobozzz 6b2f2417d5
Trigger gitpod rebuild
To update nodejs version
2024-07-01 08:48:59 +02:00
Chocobozzz b570d40beb
Prune storage can also prune remote object storage 2024-06-28 15:36:11 +02:00
Chocobozzz 8faed816ab
Fix lint 2024-06-28 14:43:03 +02:00
Chocobozzz 3a71086e35
Correctly handle transcription conflicts 2024-06-28 14:42:19 +02:00
kontrollanten 3c9d1ff5e3 client: extend logging
In able to make it easier to debug issues.
2024-06-28 09:21:45 +02:00
Grant 4bc29f6d3f Add Express req, res to filter:api.search.videos.local.list.params 2024-06-28 09:20:39 +02:00
Chocobozzz 0b30e58f6f
Remove verbose option from transcription
Can be specified on-demand using NODE_DEBUG=execa env variable
2024-06-28 08:51:35 +02:00
Chocobozzz b66963fe6f
Runner can choose job type 2024-06-28 08:44:59 +02:00
Chocobozzz fd4831e502
Metadata to know if the caption is auto generated 2024-06-28 08:44:59 +02:00
Chocobozzz 1bfb791e05
Integrate transcription in PeerTube 2024-06-28 08:44:58 +02:00
lutangar ef14cf4a5c
feat(transcription): groundwork
chore: fiddling around some more

chore: add ctranslate2 and timestamped

chore: add performance markers

chore: refactor test

chore: change worflow name

chore: ensure Python3

chore(duration): convert to chai/mocha syntahx

chore(transcription): add individual tests for others transcribers

chore(transcription): implement formats test of all implementations

Also compare result of other implementation to the reference implementation

chore(transcription): add more test case with other language and models size and local model

chore(test): wip ctranslate 2 adapat

chore(transcription): wip transcript file and benchmark

chore(test): clean a bit

chore(test): clean a bit

chore(test): refacto timestamed spec

chore(test): update workflow

chore(test): fix glob expansion with sh

chore(test): extract some hw info

chore(test): fix async tests

chore(benchmark): add model info

feat(transcription): allow use of a local mode in timestamped-whisper

feat(transcription): extract run and profiling info in own value object

feat(transcription): extract run concept in own class an run more bench

chore(transcription): somplify run object only a uuid is now needed and add more benchmark scenario

docs(transcription): creates own package readme

docs(transcription): add local model usage

docs(transcription): update README

fix(transcription): use fr video for better comparison

chore(transcription): make openai comparison passed

docs(timestamped): clea

chore(transcription): change transcribers transcribe method signature

Introduce whisper builtin model.

fix(transcription): activate language detection

Forbid transcript creation without a language.
Add `languageDetection` flag to an engine and some assertions.

Fix an issue in `whisper-ctranslate2` :
https://github.com/Softcatala/whisper-ctranslate2/pull/93

chore(transcription): use PeerTube time helpers instead of custom ones

Update existing time function to output an integer number of seconds and add a ms human-readable time formatter with hints of tests.

chore(transcription): use PeerTube UUID helpers

chore(transcription): enable CER evaluation

Thanks to this recent fix in Jiwer <3
https://github.com/jitsi/jiwer/issues/873

chore(jiwer): creates JiWer package

I'm not very happy with the TranscriptFileEvaluator constructor... suggestions ?

chore(JiWer): add usage in README

docs(jiwer): update JiWer readme

chore(transcription): use FunMOOC video in fixtures

chore(transcription): add proper english video fixture

chore(transcription): use os tmp directory where relevant

chore(transcription): fix jiwer cli test reference.txt

chore(transcription): move benchmark out of tests

chore(transcription): remove transcription workflow

docs(transcription): add benchmark info

fix(transcription): use ms precision in other transcribers

chore(transcription): simplify most of the tests

chore(transcription): remove slashes when building path with join

chore(transcription): make fromPath method async

chore(transcription): assert path to model is a directory for CTranslate2 transcriber

chore(transcription): ctranslate2 assertion

chore(transcription): ctranslate2 assertion

chore(transcription): add preinstall script for Python dependencies

chore(transcription): add download and unzip utils functions

chore(transcription): add download and unzip utils functions

chore(transcription): download & unzip models fixtures

chore(transcription): zip

chore(transcription): raise download file test timeout

chore(transcription): simplify download file test

chore(transcription): add transcriptions test to CI

chore(transcription): raise test preconditions timeout

chore(transcription): run preinstall scripts before running ci

chore(transcription): create dedicated tmp folder for transcriber tests

chore(transcription): raise timeout some more

chore(transcription): raise timeout some more

chore(transcription): raise timeout some more

chore(transcription): raise timeout some more

chore(transcription): raise timeout some more

chore(transcription): raise timeout some more

chore(transcription): raise timeout some more

chore(transcription): raise timeout some more

chore(transcription): use short video for local model test

chore(transcription): raise timeout some more

chore(transcription): raise timeout some more

chore(transcription): raise timeout some more

chore(transcription): setup verbosity based on NODE_ENV value
2024-06-28 08:43:40 +02:00
Chocobozzz b10482e0e0
Fix lint 2024-06-27 14:34:18 +02:00
Chocobozzz ef0a6b267b
Add server restart test 2024-06-27 11:56:48 +02:00
Chocobozzz 564089d3e6
Fix E2E tests 2024-06-27 11:48:03 +02:00
Chocobozzz 43e186ef44
Add views tag to middlewares too 2024-06-27 10:51:36 +02:00
Chocobozzz 2728810f60
Remove bundlewatch
CI fails, our projects generates too many chunks unfortunately
2024-06-26 14:20:45 +02:00
Chocobozzz 9b2a054e54
Fix lint 2024-06-26 14:20:06 +02:00
Chocobozzz 9772280e99
Upgrade to angular 18 & vite 2024-06-26 08:33:54 +02:00
Chocobozzz ec33467261
Update client dependencies 2024-06-24 10:01:38 +02:00
Chocobozzz bc8c853a53
Fix lint 2024-06-24 09:48:57 +02:00
Chocobozzz c49b67b16f
Update translations 2024-06-24 09:40:42 +02:00
chocobozzz af67a6e13a
Translated using Weblate (Ukrainian)
Currently translated at 89.7% (2169 of 2418 strings)

Translation: PeerTube/angular
Translate-URL: https://weblate.framasoft.org/projects/peertube/angular/uk/
2024-06-24 09:34:13 +02:00
Besnik Bleta 8c93ecfa09
Translated using Weblate (Albanian)
Currently translated at 99.3% (144 of 145 strings)

Translation: PeerTube/player
Translate-URL: https://weblate.framasoft.org/projects/peertube/player/sq/
2024-06-24 09:20:41 +02:00