#ffmpeg-devel on 2025-10-04 — irc logs at libera.catirclogs.org

2025-08-22 10:25 michaelni changed the topic of #ffmpeg-devel to: Welcome to the FFmpeg development channel | Questions about using FFmpeg or developing with libav* libs should be asked in #ffmpeg | This channel is publicly logged | FFmpeg 8.0 has been released! | Please read ffmpeg.org/developer.html#Code-of-conduct

00:04 justache is now known as KennyLog-in

00:07 DauntlessOne4985 has joined #ffmpeg-devel

00:16 <Guest40> Well, since asking here saves me a little effort...

00:16 <Guest40> I'm modifying avfilter/asrc_flite to add support for generating speech using user-provided .flitevox files, as well as using user-provided pronunciations from text files.

00:16 <Guest40> I want to use the existing voice_entry struct to store cached loaded voices in a static data structure.

00:16 <Guest40> I tried to use an av_tree for storing these entries, but I've concluded that there's no correct way to use it in a way that meets my needs.

00:16 <Guest40> Namely, to retrieve a cached voice, I need to search for the voice_entry whose source file path matches the one passed as an option to the filter,

00:16 <Guest40> and for removing a cached voice that is no longer in use, I need to search for a voice_entry whose voice is equal to flite->voice.

00:16 <Guest40> As a result, I've decided to use a dynamic array of pointers with corresponding count and capacity variables.

00:16 <Guest40> Is there an appropriate pre-existing libavutil data structure I could use, and if not, would it be appropriate to add my implementation to libavutil as part of its own patch?

00:23 <mkver> Guest40: av_dynarray2_add

00:23 <mkver> But how would one register user-provided files at all?

00:24 welder has quit [Quit: WeeChat 4.6.3]

00:24 <Guest40> flite/flite.h exposes a function `cst_voice *flite_voice_load(char *)", which I call using an argument passed as a filter option.

00:25 <mkver> And how does flite now of the user provided files?

00:25 <mkver> And given that it is handled by flite, it is not properly referenced-counted, isn't it?

00:26 <Guest40> As part of unregistering the loaded voice in the created voice entry's unregister_fn, I call delete_voice.

00:26 <Guest40> I've actually tested all of this and have gotten it to work with no apparent bugs.

00:27 <mkver> My flite.h doesn't have a delete_voice.

00:27 <mkver> Apparently it's in cst_voice.h

00:28 <Guest40> yeah, I was gonna say that it was included in a different header that flite.h itself includes.

00:33 <Guest40> With respect to using the av_dynarray_add family of functions, I didn't think to use them because I only saw them used in dense arrays. In my use case I'm nulling out the array indices of unregistered voice entries, while my insertion code puts the to-be-inserted item in the first null index, only expanding the array if it's full. But I suppose I

00:33 <Guest40> can use one of those functions specifically in the branch of insertion code that gets run if the array needs expanding.

00:33 <mkver> Could we actually use delete_voice() instead of calling the per-voice unregister_* function?

00:34 <Guest40> I'll have to take a look at that in comparison to the unregister_cmu_us_XXX functions.

00:42 <Guest40> unregister_cmu_us_XXX just calls delete_voice and nulls out the global pointer for the specified voice. That second part is essential because otherwise, calling register_cmu_us_XXX again after that would return a pointer to already-freed memory.

00:42 <Guest40> So we can't just use delete_voice as the unregister_fn for the built-in voices.

00:45 <mkver> Is there a way to register one of the built-in voices like the user provided ones?

00:51 <Guest40> doesn't look like there is.

00:52 <mkver> Damn. This would solve the refcounting issue.

00:53 kasper93 has quit [Remote host closed the connection]

00:53 kasper93 has joined #ffmpeg-devel

01:34 <mkver> Guest40: How expensive is initializing a voice actually?

01:38 System_Error has quit [Remote host closed the connection]

01:39 minimal has quit [Quit: Leaving]

01:45 System_Error has joined #ffmpeg-devel

01:59 kode54 has quit [Quit: WeeChat 4.7.1]

02:00 kode54 has joined #ffmpeg-devel

02:03 <Guest40> mkver: Can't seem to get perf on the distro I'm currently using.

02:03 <mkver> And how much mem does it take (approximately)?

02:26 realies24 has joined #ffmpeg-devel

02:27 realies2 has quit [Read error: Connection reset by peer]

02:27 realies24 is now known as realies2

02:44 <Guest40> I'm really not used to using profiling tools like this.

02:44 <Guest40> Could I be pointed in the direction of what tools I should use?

02:48 <mkver> valgrind: the default tool gives the number of allocations of your programm as well as the combined size of them; and massif (https://valgrind.org/docs/manual/ms-manual.html) can do even more than that.

02:50 <Guest40> And if you also want to know the execution time of initializing a built-in voice verses one loaded from a file?

02:52 <mkver> I am not really interested in that.

02:53 <mkver> The reason I am asking for this info is because I want to know whether sharing the voices provided by files is worth it at all.

02:53 <fjlogger> [FFmpeg/FFmpeg] New comment on pull request #20399 configure: provide vulkan cflags (https://code.ffmpeg.org/FFmpeg/FFmpeg/pulls/20399#issuecomment-11192) by L⁠ynne

02:53 <Guest40> Ah, okay. I'll give you the memory costs. I'll be comparing a builtin voice and the same voice loaded from a flitevox file.

02:54 <mkver> There is also a second reason: Who says that the same filename always refers to the same file? tmpfiles can change.

02:56 <mkver> Actually, the comparison should be "voice loaded from flitevox file" versus "not loading a voice at all". Because this will show the cost of loading a voice.

03:09 <Guest40> A clustergen voice (one of the more complex types of supported flitevox voice files) takes about 15MB.

03:11 <Guest40> mkver: Is 15MB a big enough memory cost to justify caching voices?

03:12 <Guest40> s/15/2

03:12 <Guest40> ugh, not used to IRC

03:12 <mkver> 2MB does not sound like too much.

03:13 <Guest40> Actually, it does look like 15 MB. Just thought I was misreading the units, but I wasn't.

03:14 <mkver> 15MB is indeed a bit much; but remember the "tmpfiles can change"? Who says that they don't?

03:15 <Guest40> You have a good point.

03:15 <Guest40> Just in case a tmpfile changes in the time between it being loaded for one filter, and a second filter requesting to load it BEFORE the first filter has finished using the voice it loaded from that file.

03:16 Kimapr has quit [Ping timeout: 264 seconds]

03:17 <Guest40> There's a tradeoff between memory usage and the risks of what feels like an edge case to me at first glance.

03:18 Kimapr has joined #ffmpeg-devel

03:20 <Guest40> Would it be appropriate to have the voice cache getter use the mtime of the file as part of its check to determine if we're loading the same file?

03:23 <Guest40> If not, would the hash of the file be usable in a similar way?

03:28 <mkver> I am undecided on all of this; it should best be discussed in the PR.

03:32 Martchus has joined #ffmpeg-devel

03:34 Martchus_ has quit [Ping timeout: 245 seconds]

04:16 Kimapr has quit [Ping timeout: 248 seconds]

04:20 jamrial has quit []

04:26 <fjlogger> [FFmpeg/FFmpeg:master] 1 new commit (https://code.ffmpeg.org/FFmpeg/FFmpeg/compare/ab7d1c64c9aa9186acb1d988d020e59f2d3defce...15a9c8dea32048d9746329430313f5700ba74ff3) pushed by F⁠Fmpeg

04:26 <fjlogger> [FFmpeg/FFmpeg] Pull request #20633 merged: avcodec/liblc3enc: Avoid allocating buffer to send a zero frame (https://code.ffmpeg.org/FFmpeg/FFmpeg/pulls/20633) by m⁠kver

04:48 Kimapr has joined #ffmpeg-devel

05:12 _whitelogger has joined #ffmpeg-devel

05:13 BradleyS has quit [Read error: Connection reset by peer]

05:13 BradleyS has joined #ffmpeg-devel

05:28 _whitelogger has joined #ffmpeg-devel

06:50 Kimapr has joined #ffmpeg-devel

06:54 <fjlogger> [FFmpeg/FFmpeg] Pull request #20643 opened: avutil/attributes: don't force format checking to __gnu_printf__ on mingw build (https://code.ffmpeg.org/FFmpeg/FFmpeg/pulls/20643) by k⁠asper93

06:58 <fjlogger> [FFmpeg/FFmpeg] Pull request #20644 opened: avfilter_flite_voicefile_support (https://code.ffmpeg.org/FFmpeg/FFmpeg/pulls/20644) by Y⁠0SH1M4S73R

07:13 Guest40 has quit [Quit: Client closed]

07:20 Kimapr has quit [Remote host closed the connection]

07:21 Kimapr has joined #ffmpeg-devel

08:03 mateo` has quit [Ping timeout: 256 seconds]

08:48 rvalue has quit [Read error: Connection reset by peer]

08:50 rvalue has joined #ffmpeg-devel

10:07 <fjlogger> [FFmpeg/FFmpeg] New comment on pull request #20642 Allow the user to limit metadata length and bext coding history (https://code.ffmpeg.org/FFmpeg/FFmpeg/pulls/20642#issuecomment-11212) by k⁠odawah

10:10 Kimapr_ has joined #ffmpeg-devel

10:10 Kimapr has quit [Remote host closed the connection]

10:35 Kimapr_ has quit [Ping timeout: 240 seconds]

11:09 bsFFFFFF has joined #ffmpeg-devel

11:25 <fjlogger> [FFmpeg/FFmpeg] New comment on pull request #20642 Allow the user to limit metadata length and bext coding history (https://code.ffmpeg.org/FFmpeg/FFmpeg/pulls/20642#issuecomment-11213) by m⁠ichaelni

11:35 minimal has joined #ffmpeg-devel

11:44 acryo has quit [Read error: Connection reset by peer]

11:49 zsoltiv has quit [Ping timeout: 248 seconds]

11:50 zsoltiv__ has quit [Ping timeout: 244 seconds]

11:54 zsoltiv_ has joined #ffmpeg-devel

11:55 zsoltiv has joined #ffmpeg-devel

12:03 acryo has joined #ffmpeg-devel

12:13 mkver has quit [Ping timeout: 248 seconds]

12:19 mkver has joined #ffmpeg-devel

12:49 <fjlogger> [FFmpeg/FFmpeg] New comment on pull request #20642 Allow the user to limit metadata length and bext coding history (https://code.ffmpeg.org/FFmpeg/FFmpeg/pulls/20642#issuecomment-11214) by k⁠odawah

13:16 jamrial has joined #ffmpeg-devel

13:20 Kimapr_ has joined #ffmpeg-devel

14:06 <fjlogger> [FFmpeg/FFmpeg] New comment on pull request #20615 libavfilter: cuda and alpha mode (https://code.ffmpeg.org/FFmpeg/FFmpeg/pulls/20615#issuecomment-11217) by q⁠uink

14:06 <fjlogger> [FFmpeg/FFmpeg] Pull request #20586 closed: avformat/movenc_ttml: fix memleaks (https://code.ffmpeg.org/FFmpeg/FFmpeg/pulls/20586) by q⁠uink

14:08 <fjlogger> [FFmpeg/FFmpeg] New comment on pull request #20615 libavfilter: cuda and alpha mode (https://code.ffmpeg.org/FFmpeg/FFmpeg/pulls/20615#issuecomment-11220) by B⁠tbN

14:50 mohit has joined #ffmpeg-devel

14:50 mohit has quit [Client Quit]

14:51 mateo` has joined #ffmpeg-devel

15:47 <fjlogger> [FFmpeg/FFmpeg] Pull request #20645 opened: h264qpel (https://code.ffmpeg.org/FFmpeg/FFmpeg/pulls/20645) by m⁠kver

15:49 ___nick___ has joined #ffmpeg-devel

15:52 BradleyS has quit [Quit: quit]

15:54 BradleyS has joined #ffmpeg-devel

15:56 tufei has quit [Remote host closed the connection]

16:02 elvis_a_presley has quit [Quit: smoke-bomb ; grapple-hook]

16:02 elvis_a_presley has joined #ffmpeg-devel

16:06 <fjlogger> [FFmpeg/FFmpeg:master] 2 new commits (https://code.ffmpeg.org/FFmpeg/FFmpeg/compare/8fad52bd57d5bcedce8dc4ae3166c1a50f895690...e05f8acabff468c1382277c1f31fa8e9d90c3202) pushed by F⁠Fmpeg

16:06 <fjlogger> [FFmpeg/FFmpeg] Pull request #20634 merged: vf_blend (https://code.ffmpeg.org/FFmpeg/FFmpeg/pulls/20634) by m⁠kver

16:17 ___nick___ has quit [Quit: https://quassel-irc.org - Chat comfortably. Anywhere.]

16:17 ___nick___ has joined #ffmpeg-devel

16:23 welder has joined #ffmpeg-devel

16:25 mkver has quit [Ping timeout: 256 seconds]

16:44 tufei has joined #ffmpeg-devel

17:00 tufei_ has joined #ffmpeg-devel

17:01 tufei has quit [Ping timeout: 272 seconds]

17:17 ___nick___ has quit [Quit: https://quassel-irc.org - Chat comfortably. Anywhere.]

17:18 DauntlessOne4985 has quit [Ping timeout: 248 seconds]

17:19 ___nick___ has joined #ffmpeg-devel

17:21 ___nick___ has quit [Client Quit]

17:23 <fjlogger> [FFmpeg/FFmpeg] Pull request #20646 opened: tests/checkasm: add a test for dcadsp (https://code.ffmpeg.org/FFmpeg/FFmpeg/pulls/20646) by j⁠amrial

17:23 ___nick___ has joined #ffmpeg-devel

17:27 <fjlogger> [FFmpeg/FFmpeg] New comment on pull request #20646 tests/checkasm: add a test for dcadsp (https://code.ffmpeg.org/FFmpeg/FFmpeg/pulls/20646#issuecomment-11237) by C⁠ourmisch

17:37 <kasper93> wbs: I've added you as reviever on #20643 if you don't mind.

17:50 microlappy has joined #ffmpeg-devel

17:55 microlappy has quit [Remote host closed the connection]

17:56 microlappy has joined #ffmpeg-devel

17:56 microlappy has quit [Remote host closed the connection]

17:57 microlappy has joined #ffmpeg-devel

18:02 microlappy has quit [Remote host closed the connection]

18:05 <fjlogger> [FFmpeg/FFmpeg] New comment on pull request #20642 Allow the user to limit metadata length and bext coding history (https://code.ffmpeg.org/FFmpeg/FFmpeg/pulls/20642#issuecomment-11240) by m⁠ichaelni

18:11 <fjlogger> [FFmpeg/FFmpeg] New comment on pull request #20642 Allow the user to limit metadata length and bext coding history (https://code.ffmpeg.org/FFmpeg/FFmpeg/pulls/20642#issuecomment-11241) by k⁠odawah

18:27 arch1t3cht1 is now known as arch1t3cht

18:29 <fjlogger> [FFmpeg/FFmpeg] New comment on pull request #20646 tests/checkasm: add a test for dcadsp (https://code.ffmpeg.org/FFmpeg/FFmpeg/pulls/20646#issuecomment-11243) by j⁠amrial

18:33 <fjlogger> [FFmpeg/FFmpeg] New comment on pull request #20643 avutil/attributes: don't force format checking to __gnu_printf__ on mingw build (https://code.ffmpeg.org/FFmpeg/FFmpeg/pulls/20643#issuecomment-11244) by r⁠amiro

18:38 <fjlogger> [FFmpeg/FFmpeg] New comment on pull request #20643 avutil/attributes: don't force format checking to __gnu_printf__ on mingw build (https://code.ffmpeg.org/FFmpeg/FFmpeg/pulls/20643#issuecomment-11247) by k⁠asper93

18:39 Everything has joined #ffmpeg-devel

18:51 <fjlogger> [FFmpeg/FFmpeg] New comment on pull request #20642 Allow the user to limit metadata length and bext coding history (https://code.ffmpeg.org/FFmpeg/FFmpeg/pulls/20642#issuecomment-11250) by m⁠ichaelni

19:37 MisterMinister has joined #ffmpeg-devel

19:52 Kei_N_ has joined #ffmpeg-devel

19:53 Kei_N has quit [Read error: Connection reset by peer]

20:04 ___nick___ has quit [Ping timeout: 264 seconds]

20:08 iive has joined #ffmpeg-devel

20:14 <fjlogger> [FFmpeg/FFmpeg] New comment on pull request #20643 avutil/attributes: don't force format checking to __gnu_printf__ on mingw build (https://code.ffmpeg.org/FFmpeg/FFmpeg/pulls/20643#issuecomment-11252) by J⁠amaika1

20:31 <fjlogger> [FFmpeg/FFmpeg] New comment on pull request #20643 avutil/attributes: don't force format checking to __gnu_printf__ on mingw build (https://code.ffmpeg.org/FFmpeg/FFmpeg/pulls/20643#issuecomment-11254) by m⁠storsjo

20:44 MisterMinister has quit [Ping timeout: 240 seconds]

20:45 MisterMinister has joined #ffmpeg-devel

20:49 microlappy has joined #ffmpeg-devel

20:52 microlappy has quit [Remote host closed the connection]

20:53 microlappy has joined #ffmpeg-devel

20:53 Everything has quit [Quit: leaving]

20:53 microlappy has quit [Remote host closed the connection]

21:02 <fjlogger> [FFmpeg/FFmpeg] New comment on pull request #20643 avutil/attributes: don't force format checking to __gnu_printf__ on mingw build (https://code.ffmpeg.org/FFmpeg/FFmpeg/pulls/20643#issuecomment-11256) by k⁠asper93

21:08 <fjlogger> [FFmpeg/FFmpeg] New comment on pull request #20643 avutil/attributes: don't force format checking to __gnu_printf__ on mingw build (https://code.ffmpeg.org/FFmpeg/FFmpeg/pulls/20643#issuecomment-11258) by k⁠asper93

21:24 bsFFFFFF has quit [Quit: bsFFFFFF]

22:03 Kimapr_ has quit [Remote host closed the connection]

22:03 Kimapr_ has joined #ffmpeg-devel

23:06 iive has quit [Ping timeout: 240 seconds]