#ffmpeg-devel on 2025-06-03 — irc logs at libera.catirclogs.org

2025-03-03 01:04 michaelni changed the topic of #ffmpeg-devel to: Welcome to the FFmpeg development channel | Questions about using FFmpeg or developing with libav* libs should be asked in #ffmpeg | This channel is publicly logged | FFmpeg 7.1.1 has been released! | Please read ffmpeg.org/developer.html#Code-of-conduct

00:10 halloy5771 has quit [Read error: Connection reset by peer]

01:12 <cone-867> ffmpeg Emma Worley master:854b8690a628: Add myself to MAINTAINERS for dxv/dxvenc

01:36 minimal has quit [Quit: Leaving]

02:08 halloy5771 has joined #ffmpeg-devel

02:26 <fflogger> [editedticket] MasterQuestionable: Ticket #11610 ([avfilter] Auto dummy-"split" for filters expecting > 1 input?) updated https://trac.ffmpeg.org/ticket/11610#comment:6

02:49 bilboed has quit [Ping timeout: 244 seconds]

02:53 Anthony_ZO has joined #ffmpeg-devel

02:56 blb has joined #ffmpeg-devel

02:56 bilboed has joined #ffmpeg-devel

03:06 halloy5771 has quit [Read error: Connection reset by peer]

03:20 <fflogger> [editedticket] MasterQuestionable: Ticket #11619 ([avcodec] "libopus" downmix (5.1 -> 2ch) unintended volume clipping) updated https://trac.ffmpeg.org/ticket/11619#comment:2

03:22 <fflogger> [editedticket] CosmicSkye: Ticket #9996 ([ffmpeg] Write joc_complexity_index to dec3 (EAC3SpecificBox), Windows and Android need it to play atmos) updated https://trac.ffmpeg.org/ticket/9996#comment:14

03:25 <Lynne> michaelni: err, you've added a person with 0 commits to maintainers?

03:26 <Lynne> oh, nevermind

03:29 mkver has quit [Ping timeout: 248 seconds]

03:34 usagi_mimi has quit [Quit: WeeChat 4.6.3]

03:35 jamrial has quit []

03:43 Martchus has joined #ffmpeg-devel

03:43 usagi_mimi has joined #ffmpeg-devel

03:43 Martchus_ has quit [Ping timeout: 252 seconds]

03:45 System_Error has quit [Ping timeout: 264 seconds]

03:53 <cone-867> ffmpeg Emma Worley master:6fdb54ddee69: lavc/hashtable: create generic robin hood hash table

03:53 <cone-867> ffmpeg Emma Worley master:2de0d095b84f: lavc/dxvenc: migrate DXT1 encoder to lavc hashtable

03:53 <cone-867> ffmpeg Emma Worley master:d4556c98f02e: lavc/dxvenc: improve compatibility with Resolume products

04:06 System_Error has joined #ffmpeg-devel

04:30 halloy5771 has joined #ffmpeg-devel

04:46 halloy5771 has quit [Quit: halloy5771]

05:00 System_Error has quit [Remote host closed the connection]

05:05 Chagalle has quit [Ping timeout: 248 seconds]

05:06 Chagall has joined #ffmpeg-devel

06:53 cone-867 has quit [Quit: transmission timeout]

07:32 ngaullier has joined #ffmpeg-devel

07:32 jkhsjdhjs has quit [Ping timeout: 268 seconds]

07:33 jkhsjdhjs has joined #ffmpeg-devel

07:44 nevcairiel has quit [Quit: http://quassel-irc.org - Chat comfortably. Anywhere.]

07:45 nevcairiel has joined #ffmpeg-devel

07:49 Kei_N has joined #ffmpeg-devel

07:52 Kei_N_ has quit [Ping timeout: 245 seconds]

08:33 <fflogger> [editedticket] nacioss: Ticket #11619 ([avcodec] "libopus" downmix (5.1 -> 2ch) unintended volume clipping) updated https://trac.ffmpeg.org/ticket/11619#comment:3

09:10 <ramiro> haasn: on swscale6_clean, by just setting "sws[1]->flags = mode.flags | SWS_UNSTABLE;", I get a failure in "./libswscale/tests/swscale -unscaled 1 -src yuv444p -dst argb": SSIM {Y=0.925251 U=0.975193 V=0.962438 A=1.000000}, loss 0.0660362 is WORSE by 0.0660214, expected loss 1.48416e-05

09:12 <ramiro> also with -cpuflags 0. the error oscillates between SSIM {Y=0.788072 U=0.857437 V=0.786510 A=1.000000} and SSIM {Y=0.660499 U=0.932519 V=0.900159 A=1.000000} with the same command line on different runs

09:12 <ramiro> valgrind doesn't complain. this is... odd. I haven't investigated further

09:12 <ramiro> also, this is on x86

09:33 <haasn> ramiro: I don't see this

09:34 <haasn> https://bpa.st/raw/PHDA

09:35 <haasn> btw you can use -flags 0x100000 to test as well

09:36 <fflogger> [newticket] francoisk: Ticket #11620 ([avutil] av_malloc_array(): nmemb and size arguments transposed) created https://trac.ffmpeg.org/ticket/11620

09:44 j45_ has joined #ffmpeg-devel

09:44 <ramiro> haasn: hm, this started happening when I switched from gcc to clang. it works with gcc.

09:45 <ramiro> Debian clang version 14.0.6

09:45 j45 has quit [Ping timeout: 248 seconds]

09:45 j45_ is now known as j45

09:45 j45 has quit [Changing host]

09:45 j45 has joined #ffmpeg-devel

09:47 <haasn> Okay, that seems suspicious; I'll have a look

09:53 keith has quit [Remote host closed the connection]

09:53 keith has joined #ffmpeg-devel

10:00 System_Error has joined #ffmpeg-devel

10:09 <ramiro> haasn: also could you consider splitting ff_sws_solve_shuffle() into something like this: https://github.com/ramiropolla/ffmpeg/commit/4d65b3c2

10:11 <ramiro> for neon I have no issues with cross-lane shuffles. I have a shuffle mask of up to 128 bytes, and the num_groups calculation is a little bit different

10:17 <haasn> ramiro: can't you just always set size = 128 and then only use the subset that you care about? (i.e. the largest gcd of *read_bytes and *write_bytes that is a multiple of the lane size)

10:18 <haasn> not a huge fan of leaking the internal representation from inside ff_sws_solve_shuffle to the caller, but we can find a different solution

10:19 <haasn> e.g. we could have a minimum_size and a maximum_size

10:19 <haasn> or a lane_size and vector_size

10:19 <haasn> where lane_size = vector_size if cross-lane shuffles are not supported

10:20 <haasn> (or lane_size and max_lanes)

10:22 <fflogger> [newticket] Wallboy: Ticket #11621 ([ffmpeg] Add datetime/time prefix support for FFREPORT as well) created https://trac.ffmpeg.org/ticket/11621

10:36 <thardin> haasn: I told the local spotify guy about your scaling work. he sounded mighty impressed

11:25 <ramiro> haasn: setting size=128 and recalculating block_size/read_bytes/write_bytes after the call to ff_sws_solve_shuffle() works. it's not pretty, but it works. thanks for the suggestion!

11:25 <haasn> thardin: I don't suppose they would be interested in funding it?

11:28 <ramiro> haasn: what about https://github.com/ramiropolla/ffmpeg/commit/2879d1c1 ?

11:29 Anthony_ZO has quit [Ping timeout: 260 seconds]

11:29 <haasn> ramiro: I was thinking that we should be able to output a separate or_mask to handle clearing to nonzero values

11:29 <haasn> or does NEON have a magic value that clears to 0xff?

11:30 <ramiro> haasn: no, I use an or mask. outputting a separate or_mask that supports all consts is a superior solution.

11:30 <haasn> okay, I will implement that (in a bit)

11:30 <haasn> useful for x86 as well

11:31 <haasn> I am a bit wary about growing the shuffle solver too much, since the idea was to _avoid_ having so many bespoke fast paths, in favor of having a fast general solution

11:31 <haasn> but we just can't beat existing asm without _some_ level of fast shuffle

11:32 <haasn> the alternative idea I had floating in my mind was to have a dedicated SWS_OP_SHUFFLE and optimize packed reads etc down to shuffle instructions

11:32 <ramiro> btw I think there was no non-ff const in the conversions that used the shuffle solver. I think I ended up getting rid of that TODO from my commit

11:32 <haasn> (this could be a generalization of byte swapping)

11:33 <ramiro> hadn't you already tried that and decided to do it otherwise? it would require some generalized packed/planar byte representation or something

11:33 <haasn> I don't remember exactly why I abandoned that idea

11:33 <ramiro> thardin: also if they're interested in funding neon optimizations also let me know :P

11:34 <ramiro> haasn: time to dig irclogs :)

11:36 <haasn> I was thinking gray -> vuyx could need a non-1/0 clear but gray is always full range

11:37 <haasn> anyways, I guess it's low importance

11:37 <haasn> but I think given that the implementation is basically the same anyways, I might as well write the code to support all clear values

11:38 <haasn> some food for thought is that if we end up splitting components into separate chains like I proposed several times, we may end up with some chains that actually have no SWS_OP_READ

11:38 <haasn> consisting of just SWS_OP_CLEAR and SWS_OP_WRITE

11:38 <haasn> you could revive your memset fast path for those

12:02 mkver has joined #ffmpeg-devel

12:19 jamrial has joined #ffmpeg-devel

12:57 <thardin> haasn: good question

12:57 <thardin> I think they have enough compute on the backend. biggest problem seems to be ||izing decode

13:48 <ramiro> haasn: yes, I kind of already do that with asmjit. if it's clear+(optional bswap)+write in a planar format, the clear is done only once in setup

13:57 <fflogger> [editedticket] francoisk: Ticket #11620 ([avutil] av_malloc_array() and av_realloc_array(): nmemb and size arguments transposed) updated https://trac.ffmpeg.org/ticket/11620#comment:2

13:59 <ramiro> haasn: another thing, one benefit of the memops neon backend I had written is that it aligns the writes. I haven't checked the impact vs just having a tight loop and unaligned writes, but if this is how memset/memcpy is implemented in libc, I guess it must be worth it.

14:01 <haasn> ramiro: what if you define av_memset16 and av_memset32 in libavutil?

14:06 <ramiro> haasn: that would be cleaner and make more sense

14:08 <ramiro> I had also added a left shift operator, but that's too specific. it might be better to just drop it and have a normal loop in the asmjit backend.

14:08 <ramiro> oh, and bswap16/32. that one might be useful as well.

14:09 <haasn> we could allow the packed shuffle solver to handle those

14:09 <haasn> I guess there's no real reason it currently forbids single plane outputs

14:13 <ramiro> haasn: I tried accepting either input or output as planar, but I quickly gave up. I just didn't try hard enough. but yes, that would help. especially with independent planes

14:16 <haasn> ramiro: https://bpa.st/raw/7OIQ

14:17 <haasn> that + plane splitting is my preferred solution here

14:17 <haasn> rather than trying to modify the solver to support planar

14:27 <ramiro> haasn: that helped with gray/yuvj444p/ya8/ya16be -> gray16[bl]e

14:28 <ramiro> I'm looking forward to seeing plane splitting :)

14:29 <ramiro> I think I'll finally be able to have all asmjit conversions being faster than legacy swscale

14:29 <ramiro> currently there are only a dozen or two that are slower, down to 0.5 or 0.7 iirc.

14:33 Traneptora has quit [Quit: Quit]

14:40 minimal has joined #ffmpeg-devel

14:45 <haasn> I kinda wanted to tackle scaling before plane splitting :p

14:45 <haasn> but I may procrastinate from that just a bit longer...

14:54 <haasn> Lynne: have you ever tried reducing the number of queues you allocate per VkDevice?

14:54 <Lynne> no, not really, do you think this would help with OOMs?

14:54 <haasn> nvidia in particular seems to take significantly longer to create devices the more queues you request, and I strongly doubt there is any practical performance benefit to allocating more than, say, 2 graphics queues

14:54 <fflogger> [newticket] lelegard: Ticket #11622 ([undetermined] Low bitrate data PID in MPEGTS disrupts live output rate) created https://trac.ffmpeg.org/ticket/11622

14:54 <haasn> at 16 graphics queues startup overhead is around 500 ms (!)

14:55 <Lynne> wow

14:55 <Lynne> that's pretty bad, yeah

14:55 <haasn> there also seems to be a static limit of 64 queues per.. process, I think

14:55 <haasn> actually I use only a single graphics queue as I never found any benefit to multiple

14:55 <haasn> (in libplacebo)

14:56 <haasn> if I try to create more than 7 VkDevices the 8th fails with VK_ERROR_INITIALIZATION_FAILED

14:56 <Lynne> there's a performance increase with using multiple video queues, beyond what just using multiple submissions

14:56 <haasn> with 1 queue per device I can create up to 63

14:56 <Lynne> I don't mind sending a patch to allocate 1 graphics queue unless it's the only queue

15:03 englishm has quit [Ping timeout: 268 seconds]

15:03 DauntlessOne496 has joined #ffmpeg-devel

15:03 DauntlessOne49 has quit [Ping timeout: 272 seconds]

15:03 DauntlessOne496 is now known as DauntlessOne49

15:04 englishm has joined #ffmpeg-devel

15:09 kylophone has quit [Ping timeout: 268 seconds]

15:10 Son_Goku has quit [Ping timeout: 268 seconds]

15:11 mindfreeze has quit [Ping timeout: 252 seconds]

15:12 termos__ has quit [Ping timeout: 252 seconds]

15:13 kylophone has joined #ffmpeg-devel

15:13 zulleyy3 has quit [Ping timeout: 252 seconds]

15:13 <haasn> Lynne: https://github.com/haasn/vulkan_limits I don't suppose you mind seeing if it affects your machine as well, and is not something weird about my environment / docker container setup?

15:13 <haasn> gcc -lvulkan vulkan_limits.c -g limits && ./limits device

15:13 Manouchehri has quit [Ping timeout: 268 seconds]

15:14 kurosu has quit [Ping timeout: 252 seconds]

15:14 termos__ has joined #ffmpeg-devel

15:14 Son_Goku has joined #ffmpeg-devel

15:14 zulleyy3 has joined #ffmpeg-devel

15:14 tortoise has quit [Ping timeout: 252 seconds]

15:14 <haasn> on RADV it creates 4095 devices in 500 ms and then fails with VK_ERROR_OUT_OF_HOST_MEMORY

15:15 Manouchehri has joined #ffmpeg-devel

15:15 <haasn> on nvidia it creates 7 devices in 4200 ms and then fails with VK_ERROR_INITIALIZATION_FAILED

15:15 <haasn> open source stronk

15:15 <Lynne> 7, 8 fails here

15:16 natto17 has quit [Ping timeout: 248 seconds]

15:16 <haasn> maybe you can ask your contacts at nvidia to unfuck their driver limits?

15:16 <Lynne> takes about 6 seconds here

15:16 <haasn> only being able to allocate 7 vulkan devices is kinda lol

15:16 <haasn> if I run two processes concurrently it already fails at 3-4 per process now

15:16 <haasn> so I think it is a true limit in the driver

15:18 <Lynne> I'll ask them

15:18 sdc has quit [Ping timeout: 272 seconds]

15:18 natto17 has joined #ffmpeg-devel

15:18 <Lynne> so, for the entire system, you can only have no more than 7 vulkan-using processes

15:18 <Lynne> ?

15:18 kurosu has joined #ffmpeg-devel

15:18 <haasn> that would appear to be what I'm seeing

15:18 <haasn> well, assuming 16 queues per device

15:19 <haasn> I'm not sure why the relationship between the number of queues and the number of devices I can allocate is not 1:1

15:19 HarshK23 has quit [Ping timeout: 248 seconds]

15:20 sdc has joined #ffmpeg-devel

15:21 mindfreeze has joined #ffmpeg-devel

15:21 ngaullier has quit [Read error: Connection reset by peer]

15:21 tortoise has joined #ffmpeg-devel

15:21 ngaullier has joined #ffmpeg-devel

15:21 realies9 has quit [Ping timeout: 248 seconds]

15:22 realies9 has joined #ffmpeg-devel

15:23 <Lynne> how has no one encountered this up until now?

15:26 <haasn> hmm, it's weird though

15:26 <haasn> if I spawn 16 processes that each allocate and indefinitely hold a VkDevice, I still only run into issues after the 7th device

15:27 <haasn> but now, if I try to create devices from two different threads, I get a segfault inside libnvidia_egl.so

15:29 Traneptora has joined #ffmpeg-devel

15:37 HarshK23 has joined #ffmpeg-devel

15:46 Kwiboo has quit [Quit: .]

15:47 Kwiboo has joined #ffmpeg-devel

16:00 <Lynne> I'm not sure I want to keep contributing with softworks having push rights

16:01 <Lynne> he should not have had commit rights, and this is why

16:02 <jamrial> Lynne: just tell him that no, he shouldn't push his set just yet

16:02 <jamrial> and that he needs to address the remaining requests

16:03 <kierank> I love the way softworkz is the one thing nearly everyone can agree on in this project

16:06 <haasn> to be fair, that list grows a lot longer for any reasonable definition of "everyone in this project"

16:09 <mkver> Has he threatened to just push his subtitle patches?

16:11 <jamrial> no, just said he would after what he considered was no objections

16:47 ngaullier has quit [Remote host closed the connection]

16:48 devinheitmueller has joined #ffmpeg-devel

16:56 toots5446 has joined #ffmpeg-devel

16:57 <toots5446> Hi all!

17:01 <toots5446> Is Andreas Reinhardt connected here?

17:02 <jamrial> that'd be mkver

17:02 <toots5446> Thanks!

17:02 <mkver> You are Romain Beauix?

17:02 <toots5446> That's me!

17:04 <toots5446> mkver: did you like the rewritten vorbis extradata patch?

17:05 <mkver> No. Writing an email right now.

17:07 <toots5446> ok thanks

17:12 <mkver> toots5446: Done

17:20 <mkver> "Some of those conversations predate my archive of the ML so may need to be restated for me to understand."

17:20 <mkver> Who/What is this Devlist archive?

17:23 TheVibeCoder has joined #ffmpeg-devel

17:25 <TheVibeCoder> Lynne: I got A-SPX in AC-4 much more useful, still if joint A-SPX is used I get crap in hf spectral part

17:28 minimal has quit [Read error: Connection reset by peer]

17:28 minimal has joined #ffmpeg-devel

17:39 devinheitmueller has quit [Quit: devinheitmueller]

17:42 devinheitmueller has joined #ffmpeg-devel

17:47 devinheitmueller has quit [Ping timeout: 276 seconds]

17:47 devinheitmueller has joined #ffmpeg-devel

17:49 microlappy has joined #ffmpeg-devel

18:03 microlappy has quit [Quit: Konversation terminated!]

18:35 <fflogger> [editedticket] nyanmisaka: Ticket #11618 ([ffmpeg] hwupload filter fails with "Cannot allocate memory" for VA-API on AMD RX 7900 XT (Navi 31) preventing H.264/HEVC hardware encoding initialization.) updated https://trac.ffmpeg.org/ticket/11618#comment:3

19:05 TheVibeCoder has quit [Ping timeout: 240 seconds]

19:10 minimal has quit [Read error: Connection reset by peer]

19:11 minimal has joined #ffmpeg-devel

19:11 TheVibeCoder has joined #ffmpeg-devel

19:26 TheVibeCoder has quit [Quit: Client closed]

19:30 <thardin> this EDL stuff in mov.c is very cursed

19:30 <thardin> we shouldn't do NLE things

19:30 <JEEB> yea dae had patches to expose it as virtual time line metadata

19:31 <JEEB> so that API clients can thus apply it, since it's not exactly someting that can well be done within the demuxer :D

19:34 <thardin> ye

19:34 <thardin> again, for NLE stuff, users should use libmlt

19:38 <thardin> what happens if an EDL cuts on anything other than an IDR frame?

19:39 <thardin> what happens if periodic intra refresh is used?

19:41 <thardin> this is why the word "no" is the most powerful word in the English language

20:15 IndecisiveTurtle has joined #ffmpeg-devel

20:32 HarshK23 has quit [Quit: Connection closed for inactivity]

20:33 <compn> mov is a cursed container

20:37 <thardin> it's not that bad. in this case it's more that there's business logic in the demuxer

20:37 <thardin> which is a recurring issue with lavf

20:38 <thardin> I've got about half of the index related stuff in mov.c converted to segmented indexes now

20:38 <compn> hows vlc mov demuxer going

20:39 <thardin> enough for today I think

20:40 <compn> https://code.videolan.org/videolan/vlc/-/tree/master/modules/demux/mp4?ref_type=heads

20:41 acryo has quit [Quit: ZNC 1.8.2 - https://znc.in]

20:41 acryo has joined #ffmpeg-devel

20:41 acryo has quit [Changing host]

21:24 witchymary has quit [Ping timeout: 265 seconds]

22:04 witchymary has joined #ffmpeg-devel

22:42 <fflogger> [editedticket] Balling: Ticket #9996 ([ffmpeg] Write joc_complexity_index to dec3 (EAC3SpecificBox), Windows and Android need it to play atmos) updated https://trac.ffmpeg.org/ticket/9996#comment:15

22:43 <fflogger> [editedticket] Noki0100: Ticket #11618 ([ffmpeg] hwupload filter fails with "Cannot allocate memory" for VA-API on AMD RX 7900 XT (Navi 31) preventing H.264/HEVC hardware encoding initialization.) updated https://trac.ffmpeg.org/ticket/11618#comment:4

22:47 linkmauve has left #ffmpeg-devel [Error from remote client]

22:48 <fflogger> [newticket] giuseppeM99: Ticket #11623 ([ffplay] FFplay crashes when seeking in .ogg file with images) created https://trac.ffmpeg.org/ticket/11623

22:52 <fflogger> [editedticket] MasterQuestionable: Ticket #11619 ([avcodec] "libopus" downmix (5.1 -> 2ch) unintended volume clipping) updated https://trac.ffmpeg.org/ticket/11619#comment:4