#ffmpeg-devel on 2025-05-20 — irc logs at libera.catirclogs.org

2025-03-03 01:04 michaelni changed the topic of #ffmpeg-devel to: Welcome to the FFmpeg development channel | Questions about using FFmpeg or developing with libav* libs should be asked in #ffmpeg | This channel is publicly logged | FFmpeg 7.1.1 has been released! | Please read ffmpeg.org/developer.html#Code-of-conduct

01:08 <fflogger> [editedticket] socram: Ticket #11493 ([ffmpeg] ffmpeg causes mkv corruption when remuxing existing MKV with WEBVTT (.vtt) subtitles that have empty cue blocks) updated https://trac.ffmpeg.org/ticket/11493#comment:2

01:22 _whitelogger has joined #ffmpeg-devel

02:21 _whitelogger has joined #ffmpeg-devel

02:30 minimal has quit [Quit: Leaving]

02:46 mkver has joined #ffmpeg-devel

02:56 mkver has quit [Ping timeout: 265 seconds]

03:14 prathamd_ has quit [Ping timeout: 276 seconds]

03:27 AtleoS has quit [Quit: AtleoS]

03:28 jamrial has quit []

03:57 _whitelogger has joined #ffmpeg-devel

04:01 Martchus_ has joined #ffmpeg-devel

04:02 Martchus has quit [Ping timeout: 260 seconds]

04:50 bwu25 has joined #ffmpeg-devel

04:51 bwu25 has quit [Client Quit]

04:57 bwu25 has joined #ffmpeg-devel

05:00 System_Error has quit [Remote host closed the connection]

05:07 System_Error has joined #ffmpeg-devel

05:14 Anthony_ZO has joined #ffmpeg-devel

05:33 bwu25 has quit [Quit: bwu25]

07:06 compn has quit [Read error: Connection reset by peer]

07:07 compn has joined #ffmpeg-devel

07:08 TheVibeCoder has joined #ffmpeg-devel

07:10 <TheVibeCoder> FFmpeg is full of bugs, switch to Rust projects immediately, yesterday was super time for switch, today is also good time.

07:29 zulleyy3 has quit [Ping timeout: 244 seconds]

07:31 zulleyy3 has joined #ffmpeg-devel

07:33 ngaullier has joined #ffmpeg-devel

07:34 ngaullier has quit [Remote host closed the connection]

07:37 ngaullier has joined #ffmpeg-devel

07:52 IndecisiveTurtle has quit [Ping timeout: 248 seconds]

07:55 DVedaa has quit [Read error: Connection reset by peer]

07:55 DVedaa has joined #ffmpeg-devel

08:02 HarshK23 has quit [Quit: Connection closed for inactivity]

08:10 System_Error has quit [Remote host closed the connection]

08:23 <fflogger> [newticket] TheTroll: Ticket #11597 ([avfilter] FFmpeg output is stuck if using split with fps filter) created https://trac.ffmpeg.org/ticket/11597

08:30 <fflogger> [editedticket] SubJunk: Ticket #11594 ([avcodec] Hardcoding subtitles broken with libx265 since 20241205) updated https://trac.ffmpeg.org/ticket/11594#comment:9

08:33 Guest92 has joined #ffmpeg-devel

08:33 System_Error has joined #ffmpeg-devel

08:35 prathamd_ has joined #ffmpeg-devel

09:14 prathamd_ has quit [Ping timeout: 260 seconds]

09:21 rvalue has quit [Read error: Connection reset by peer]

09:21 rvalue has joined #ffmpeg-devel

09:25 Guest92 has quit [Quit: Client closed]

09:41 TheVibeCoder has quit [Quit: Client closed]

09:47 TheVibeCoder has joined #ffmpeg-devel

10:03 TheVibeCoder has quit [Quit: Client closed]

10:12 prathamd_ has joined #ffmpeg-devel

10:12 prathamd_ has quit [Client Quit]

10:51 <haasn> ramiro: do you have a branch with your swscale fixes (from ML) that, presumably, fix all of the legacy swscale conversion bugs in tests/libswscale?

10:54 cone-374 has joined #ffmpeg-devel

10:54 <cone-374> ffmpeg Lynne master:ebbc7ff65067: ffv1enc_vulkan: merge all encoder variants into one file

10:54 <cone-374> ffmpeg Lynne master:a4078abd739a: vulkan/ffv1: synchronize get_pred implementations between encoder and decoder

10:54 <cone-374> ffmpeg Lynne master:7c0a8c07ce75: ffv1enc_vulkan: unify EC code between setup and encode

10:54 <cone-374> ffmpeg Lynne master:69f83bafd1fa: ffv1enc_vulkan: get rid of temporary data for the setup shader

10:54 <cone-374> ffmpeg Lynne master:52595025c5aa: ffv1enc_vulkan: minor EC optimizations

10:54 <cone-374> ffmpeg Lynne master:bd41838b6066: ffv1enc_vulkan: switch to 2-line cache, unify prediction code

10:54 <cone-374> ffmpeg Lynne master:8a2d9216275d: ffv1_common: minor RGB optimization

10:54 <cone-374> ffmpeg Lynne master:f69db914cece: ffv1enc_vulkan: use ff_get_encode_buffer

10:54 <cone-374> ffmpeg Lynne master:a24ea37228f4: vulkan_ffv1: fix PCM + cached symbol reader

10:54 <cone-374> ffmpeg Lynne master:0156680f094f: ffv1enc_vulkan: implement the cached EC writer from the decoder

10:54 <cone-374> ffmpeg Lynne master:7576410af776: ffv1enc_vulkan: implement RCT search for level >= 4

10:54 <cone-374> ffmpeg Lynne master:cb8f4b675d1e: vulkan/ffv1: unify encode and decode get/put primitives

10:54 <cone-374> ffmpeg Lynne master:7b45d9c5fdce: vulkan_ffv1: pipe through slice decoding status

10:54 <cone-374> ffmpeg Lynne master:435db9bb49e4: vulkan: enable VK_KHR_shader_subgroup_rotate

10:54 <cone-374> ffmpeg Lynne master:7c3c5c805270: hwcontext_vulkan: correct image transfer usage flags

10:54 <cone-374> ffmpeg Lynne master:eabb62813e74: hwcontext_vulkan: only try exporting DMABUF memory on !WIN32 and only for DMABUF tiling

11:05 realies9 has quit [Quit: ~]

11:05 realies9 has joined #ffmpeg-devel

11:27 <haasn> to those who were interested: nuscale C only vs legacy C only: Overall speedup=3.418x faster, min=0.162x max=103.685x

11:31 <ramiro> haasn: those fixes are on my swscale6-asmjit-neon branch (commit messages that start with [upstream])

11:31 <haasn> ah

11:33 <ramiro> my branches usually have commit messages that are not very descriptive. because obviously I will remember every single detail about them when I finally want to upstream them and it won't take me half an hour to write a proper commit message :P

11:35 Guest92 has joined #ffmpeg-devel

11:37 <ramiro> haasn: what are the worst converters in the new C code? if it's the lut-based yuv2rgb it should be fine to ignore the difference

11:38 Guest92 has quit [Client Quit]

11:39 <haasn> gray -> yuv seems like a usual offender

11:39 <haasn> I didn't save the whole output of the test run

11:39 <haasn> maybe we should modify swscale.c to start keeping track of outliers and print the worst ones at the end

11:40 <haasn> I think that already would have saved me time

11:40 <haasn> btw, I was trying to test with valgrind but for some reason even rgb24 -> bgr24 doesn't pass

11:40 <haasn> it also constantly complains about printf() depending on uninitialized values even if I explicitly try initializing everything

11:41 <haasn> something is definitely going on

11:43 <fflogger> [newticket] evgene_123: Ticket #11598 ([ffmpeg] `gdigrab` Screen Capture Causes Cursor Flickering and Focus Loss on Windows (Multi-Monitor)) created https://trac.ffmpeg.org/ticket/11598

11:44 <ramiro> I keep my results in log files for each run. I changed the formatting and alignment of the output so that I can easily see the backend that was used, and awk it into a csv. then I sort by speedup and have a closer look at the slowest ones

11:46 <ramiro> also I'd like to be able to run just the new code and get the timings for that. most of the time is spent running legacy code, which won't change between runs

11:46 <fflogger> [newticket] evgene_123: Ticket #11599 ([ffmpeg] `gdigrab` Screen Capture Causes Cursor Flickering and Focus Loss on Windows (Multi-Monitor)) created https://trac.ffmpeg.org/ticket/11599

11:49 <ramiro> haasn: I suspect gray -> yuv may benefit from working the planes independently (2 are just memsets)

11:49 <haasn> definitely

11:50 <haasn> somehow valgrind thinks all my image planes are uninitialized unless I compile with --disable-asm

11:50 <ramiro> about valgrind, yes, there are still a few issues. some memcpy where both src and dst point to the same place. I haven't investigated how it got there though.

11:50 <haasn> well, not just thinks, somehow they *are* uninitialized (SSIM goes to 0)

11:52 <ramiro> what I found the oddest about running under valgrind is that the tests fail. so there's something very very fishy about it.

11:53 <haasn> ramiro: solved the dither memleak, op_list_remove_at() didn't uninit the removed op

11:59 <haasn> with the C code only, valgrind passes

12:03 <haasn> you're right that it's only the shuffle solver causing problems though

12:05 rvalue- has joined #ffmpeg-devel

12:06 rvalue has quit [Ping timeout: 276 seconds]

12:08 <haasn> is it possible that valgrind doesn't like the fact that we read into the (uninitialized) image stride, perform a pshufb, and then write back into the stride?

12:08 <haasn> maybe it can't track that the stride part of the input doesn't bleed into the non-stride part

12:08 <haasn> (or maybe it _does_ bleed)

12:11 rvalue- is now known as rvalue

12:34 mkver has joined #ffmpeg-devel

12:37 jamrial has joined #ffmpeg-devel

12:43 minimal has joined #ffmpeg-devel

13:02 <haasn> ramiro: solved the packed shuffle issues

13:03 <haasn> turns out I had an imul between "dec yendd" and "jg .loop"

13:03 <haasn> which overwrote the flag

13:03 <haasn> no idea why only when valgrind was enabled

13:03 <haasn> maybe it changed some cpu settings

13:03 <ramiro> haasn: too aggressive manual instruction reordering?

13:04 <haasn> no, I think it was just a mistake :)

13:04 <haasn> normally I try to reorder things so that the inc/dec and the jump are adjacent

13:04 <haasn> because iirc the CPU can decode those as a single uop

13:11 <Gramner> indeed. usually referred to as "macro-op fusion". not just decoding either, a fused cmp+branch is also executed and retired as one op

13:15 <haasn> valgrind passes now :)

13:16 <linkmauve> For debugging purpose, is it possible to make ffmpeg do absolutly everything on a single thread?

13:26 <cone-374> ffmpeg Zhao Zhili master:4f7bc62c6644: avformat/allformats: Move avisynth and dvdvideo under external libraries group

13:26 <cone-374> ffmpeg Zhao Zhili master:124977664101: Makefile: Remove postproc from ALLFFLIBS

13:26 <cone-374> ffmpeg Shiyou Yin master:f41403877921: configure: identify loong64 for loongarch

13:45 <fflogger> [editedticket] Gyan: Ticket #11597 ([avfilter] FFmpeg output is stuck if using split with fps filter) updated https://trac.ffmpeg.org/ticket/11597#comment:2

13:46 <fflogger> [editedticket] TheTroll: Ticket #11597 ([avfilter] FFmpeg output is stuck if using split with fps filter) updated https://trac.ffmpeg.org/ticket/11597#comment:3

14:19 HarshK23 has joined #ffmpeg-devel

14:25 <ramiro> linkmauve: I normally use "taskset -c 0" for that

14:26 <ramiro> (or -c 4 on my rk3588 to pick an out-of-order core)

14:26 <fflogger> [newticket] rmotrescu: Ticket #11600 ([undetermined] Ffprobe/Ffmpeg with libfdk-aac doesn't recognize AAC-HE(v2) livestreams) created https://trac.ffmpeg.org/ticket/11600

14:28 <ramiro> Gramner: I believe this is also the case on aarch64 (macro-op fusion), but I'm not quite sure how to test this. it's too small to make a noticeable difference in my loops

14:29 <linkmauve> https://linkmauve.fr/files/Screenshot%20From%202025-05-20%2016-26-52.png

14:30 <linkmauve> My very first image, running ffmpeg with its Vulkan H.264 decoder (I get the same output with Gstreamer), using my Vulkan driver, which uses V4L2 stateless, to then dump to a file and exit(0). :)

14:50 <ramiro> haasn: why does yuva444p9le->yuva444p10le need dither but yuva444p9le->yuva444p12le doesn't?

14:51 <haasn> ramiro: 12 bits is considered above the visual threshold of perception

14:51 <haasn> see the logic in fmt_dither() in format.c

14:52 <thardin> do we also dither when going from say 1-bit to 8-bit?

14:52 <fflogger> [editedticket] Gyan: Ticket #11597 ([avfilter] FFmpeg output is stuck if using split with fps filter) updated https://trac.ffmpeg.org/ticket/11597#comment:4

14:53 <haasn> thardin: no, because it is lossless

14:54 <ramiro> haasn: interesting! thanks

14:55 <ramiro> haasn: in that same conversion (yuva444p9le->yuva444p10le), dithering affects all planes. but if dst is yuv444p10le instead (no alpha), no dithering is applied. should dithering be applied differently when there is alpha or not?

15:00 <haasn> no, strictly speaking it should not, but we don't currently have a way to signal dithering only for some planes

15:01 <haasn> it shouldn't change the result though since adding dithering to a lossless result won't change the output

15:01 <haasn> since the dither matrix is strictly less than 1

15:02 <haasn> so this is just an optimization and not a fundamental error, and one we will optimize for free once we split independent planes

15:02 <ramiro> haasn: got it. thanks

15:10 <ramiro> haasn: it's weird the way we treat alpha in x2bgr10le. it's only 2 bits, but we add 0.5 when dithering. I'd never heard of this pixel format, but I'm not quite sure those 2 bits should be treated as alpha.

15:11 <haasn> what destination format?

15:11 <ramiro> x2bgr10le is the destination (src gbrap14le in my specific case but any other with alpha could work)

15:12 <ramiro> it doesn't even have AV_PIX_FMT_FLAG_ALPHA in pixdesc.c

15:14 <haasn> well, internally we don't really care what we write to those two bits since we assume they may contain garbage

15:14 <jannau> x2bgr10le as in DRM_FORMAT_XBGR2101010 is used as 10 bit RGB format. the X vs. A signifies that the two bits are ignored

15:14 <haasn> it's true that our pipeline could do a better job at marking them as unused though

15:14 <haasn> maybe we should add something like an execution mask to all ops, to be able to e.g. only apply an op to some components

15:15 <haasn> and then we could mark those two bits as not part of the SWS_OP_PACK, so the rest of the pipeline can optimize away any ops affecting them

15:15 <ramiro> jannau: thanks!

15:17 <jannau> there is also DRM_FORMAT_ABGR2101010 but I doubt anyone use alpha there for anything but for cut-outs to show underlays

15:20 <haasn> 2 bits is plenty with good dithering :)

15:24 <fflogger> [editedticket] TheTroll: Ticket #11597 ([avfilter] FFmpeg output is stuck if using split with fps filter) updated https://trac.ffmpeg.org/ticket/11597#comment:5

15:30 <ramiro> haasn: I think I got to a point with asmjit (and memops) where the only conversions that are slower (less than 0.95) are the x2-related ones, and ones that will be simplified once we split independent planes. the slowdowns come from converting to float to multiply by a power of 2 (when we could have just shifted instead), and applying dithering that doesn't change the result.

15:37 <haasn> ramiro: which exact case are you hitting where we fail to optimize a pot2 mult into a shift?

15:40 <ramiro> haasn: yuva444p9le->yuva444p10le. the alpha plane has a non-power-of-2 multiplication, so all planes go through f32 and linear

15:40 <haasn> oh, still that one

15:40 <haasn> yes, right

15:41 <ramiro> if planes are treated independently, only alpha won't be a memop

15:41 <ramiro> well, I also added lshift to my memops, which makes it not really a memop.

15:52 Anthony_ZO has quit [Ping timeout: 260 seconds]

16:18 <fflogger> [editedticket] Jhon: Ticket #11298 ([undetermined] AMD VAAPI HEVC encoding a 1080p video results in a 1920x1088 one) updated https://trac.ffmpeg.org/ticket/11298#comment:6

16:23 mkver has quit [Ping timeout: 272 seconds]

16:26 cone-374 has quit [Quit: transmission timeout]

16:55 <tmatth> Lynne: getting another vulkan build failure https://paste.debian.net/1375723/

17:01 ngaullier has quit [Remote host closed the connection]

17:02 <tmatth> since 435db9bb49e48aae0ada537fe9ce9fe60d87a4f6

17:05 Traneptora has joined #ffmpeg-devel

17:06 mkver has joined #ffmpeg-devel

17:38 Traneptora_ has joined #ffmpeg-devel

17:38 Marth64[m] has joined #ffmpeg-devel

17:41 lexano_ has joined #ffmpeg-devel

17:41 Marth64 has quit [Ping timeout: 276 seconds]

17:42 Traneptora has quit [Read error: Connection reset by peer]

17:42 lexano has quit [Ping timeout: 260 seconds]

18:10 <jamrial> tmatth: does http://pastie.org/p/5sfcaadWmXbogi090CCn73/raw fix it?

18:11 <tmatth> jamrial: it did, thanks!

18:11 cone-164 has joined #ffmpeg-devel

18:11 <cone-164> ffmpeg Lynne master:842fa198e979: hwcontext_vulkan: fix build with old Vulkan header versions

18:15 <Lynne> VK_KHR_shader_subgroup_rotate should be supported within the version of the headers we support, so there shouldn't be a need for checks in this case

18:15 Flat has quit [Quit: Rip internet]

18:16 Flat has joined #ffmpeg-devel

18:25 <tmatth> Lynne: that also worked

18:25 <tmatth> thanks

18:33 rvalue- has joined #ffmpeg-devel

18:34 rvalue has quit [Ping timeout: 252 seconds]

18:40 rvalue- is now known as rvalue

19:23 frankplow has quit [Ping timeout: 260 seconds]

19:23 frankplow has joined #ffmpeg-devel

21:11 cone-164 has quit [Quit: transmission timeout]

21:32 sepro has joined #ffmpeg-devel

21:35 ^Neo has joined #ffmpeg-devel

21:35 ^Neo has quit [Changing host]

21:35 ^Neo has joined #ffmpeg-devel

21:36 <fflogger> [editedticket] Balling: Ticket #11514 ([avcodec] Cuvid decoder problem with deinterlacing) updated https://trac.ffmpeg.org/ticket/11514#comment:4

21:47 <fflogger> [editedticket] Balling: Ticket #11245 ([avformat] Slow HEIC decoding with "hevc_cuvid") updated https://trac.ffmpeg.org/ticket/11245#comment:12

21:49 <fflogger> [editedticket] Balling: Ticket #10663 ([avcodec] Vaapi hardware decoding 444 HEVC failure) updated https://trac.ffmpeg.org/ticket/10663#comment:4

21:54 <fflogger> [editedticket] Balling: Ticket #11298 ([undetermined] AMD VAAPI HEVC encoding a 1080p video results in a 1920x1088 one) updated https://trac.ffmpeg.org/ticket/11298#comment:7

22:12 <fflogger> [editedticket] oromit: Ticket #11514 ([avcodec] Cuvid decoder problem with deinterlacing) updated https://trac.ffmpeg.org/ticket/11514#comment:5

22:26 AtleoS has joined #ffmpeg-devel

22:28 <BtbN> Is it "legal" for a decoder to EAGAIN an EOF packet?

22:29 <mkver> Doesn't make any sense. What decoder?

22:29 <BtbN> Apparently not, causes an assertion in ffmpeg_dec.c

22:30 <BtbN> Why does it not make sense? the EOF packet causes the decoder to flush, and if the output buffer is all full, it can't flush

22:30 <BtbN> so what's it supposed to do, buffer the EOF request?

22:32 <mkver> I misread: I thought by "decoder" one of the internal decoders (i.e. the return value of one of the internal decode callbacks), not the return code of avcodec_send_packet().

22:34 <mkver> Or are we talking about avcodec_receive_frame()? First you say "on packet", then you mention an assert in ffmpeg_dec.c depending upon the avcodec_receive_frame() return value.

22:34 <mkver> Is this a regression since b18aaf209f007e67ac4490ba5647ea139d1a6dcb? Can you open a proper ticket?

22:35 <BtbN> No idea if it's a regression. I'd like to make cuvid EAGAIN an EOF request, since its output buffer is full at that time. But can't, since it causes an assertion

22:42 <kepstin> the eof is just supposed to start the flush process, right? afterwards, the user expects to call receive_frame until they get all frames and the decoder returns eof.

22:42 <BtbN> Well, in the case of cuvid, passing on the EOF to the underlying decoder makes it instantly barf out a ton of frames

22:43 <BtbN> Somehow it outputs more frames at once than should be possible though, so I'm looking into wtf is going on

22:43 <BtbN> like, it's configured with 8 decode surfaces

22:43 <BtbN> but on EOF, it outputs 9 at once

22:43 <BtbN> that should not be possible

22:54 <kepstin> fwiw, all the example code for the ffmpeg decode apis appear to expect avcodec_sent_packet(ctx, NULL); to always work unless there's an error that would require aborting the decode.

22:56 <kepstin> at least, they seem to rely on the case where if you call receive_frame() and get EAGAIN, then calling send_packet() immediately after must not return EAGAIN.

22:58 <BtbN> What's happening is that the max_display_delay is set to 4

22:58 <BtbN> so cuviddec.c makes sure to keep at a minimum 4 buffer-slots free, for those 4 frames to appear at any time

22:59 <BtbN> but on the flush packet, cuviddec dumps out a lot more than 4

23:00 Guest46 has joined #ffmpeg-devel

23:01 Guest46 has quit [Client Quit]

23:05 <kepstin> to the best of my understanding, it should be ok for send_packet with NULL to return EAGAIN only if it happens 1) after a previous call to send_packet with no intermediate receive_frame calls, or 2) after a receive_frame call which did not return EAGAIN.

23:05 <BtbN> hm, I guess on a flush, it just dumps out everything it got with no regard for its own buffer limits

23:05 <BtbN> so I will have to somehow withhold actually flushing the underlying decoder until the output queue is fully empty

23:07 <kepstin> yeah, i guess in that case you'd want flushing the ffmpeg "decoder" to just set a flag, and have logic so that in the following calls to recieve_frame it'll flush the underlying decoder at an appropriate time.

23:11 <BtbN> This gets fun when it dumps out more frames at once than it even HAS surfaces available

23:11 <BtbN> it just happily catches its own tail

23:18 <fflogger> [editedticket] Balling: Ticket #10668 ([avcodec] cuvid regression creates jerky output) updated https://trac.ffmpeg.org/ticket/10668#comment:9

23:20 <fflogger> [editedticket] Balling: Ticket #9285 ([avcodec] Excessive GPU memory usage with nvdec hwaccel) updated https://trac.ffmpeg.org/ticket/9285#comment:2

23:20 <fflogger> [editedticket] oromit: Ticket #10668 ([avcodec] cuvid regression creates jerky output) updated https://trac.ffmpeg.org/ticket/10668#comment:10

23:21 <fflogger> [editedticket] Balling: Ticket #10668 ([avcodec] cuvid regression creates jerky output) updated https://trac.ffmpeg.org/ticket/10668#comment:11

23:22 cone-429 has joined #ffmpeg-devel

23:22 <cone-429> ffmpeg Timo Rothenpieler master:431e2cae87b6: avcodec/cuviddec: print error when queueing frames fails

23:22 <cone-429> ffmpeg Timo Rothenpieler master:d5a9f7bdd4d9: avcodec/cuviddec: only flush cuvid when output queue is empty

23:23 <fflogger> [editedticket] oromit: Ticket #10668 ([avcodec] cuvid regression creates jerky output) updated https://trac.ffmpeg.org/ticket/10668#comment:12

23:28 <fflogger> [editedticket] Balling: Ticket #10668 ([avcodec] cuvid regression creates jerky output) updated https://trac.ffmpeg.org/ticket/10668#comment:13

23:37 <fflogger> [editedticket] oromit: Ticket #10668 ([avcodec] cuvid regression creates jerky output) updated https://trac.ffmpeg.org/ticket/10668#comment:14

23:47 <fflogger> [editedticket] oromit: Ticket #10668 ([avcodec] cuvid regression creates jerky output) updated https://trac.ffmpeg.org/ticket/10668#comment:15