#ffmpeg-devel on 2025-07-04 — irc logs at libera.catirclogs.org

2025-03-03 01:04 michaelni changed the topic of #ffmpeg-devel to: Welcome to the FFmpeg development channel | Questions about using FFmpeg or developing with libav* libs should be asked in #ffmpeg | This channel is publicly logged | FFmpeg 7.1.1 has been released! | Please read ffmpeg.org/developer.html#Code-of-conduct

00:16 iive has quit [Quit: They came for me...]

00:28 <steven-netint> hi everyone, does anyone know if HW accelerators like qsv/nvenc/amf are tested in FATE? and, are they tested on every patch, nightly, or only during releases?

00:33 <jamrial> no, they are not

00:33 <jamrial> fate tests don't cover hw or external modules

00:46 minimal has quit [Quit: Leaving]

00:50 <frankplow> averne: Is there some syntax element which can be used to indicate more than 1 coefficient is insignificant? HEVC and VVC have similar scan orders, and there it is because the TU is divided into CGs aka subblocks

00:52 <BtbN> steven-netint: how would a fate test for that even work? The output can chance on any driver update.

00:52 <BtbN> *change

00:52 <BtbN> you can test hwdecs though, but we don't do that either

00:54 <steven-netint> im curious how other HW accel vendors would ensure their code is not broken after a ffmpeg patch in framework, etc.

00:54 <frankplow> averne: I don't know if I would say it's a hybrid of Morton and zig-zag scans, for 2x2 scans (intra-subblock and inter-subblock scans in the top-left 4x4 region) the zig-zag scan and Morton scan are equivalent so you can view the whole thing as a two-level hierarchy of zig-zag scans of different sizes

00:54 <BtbN> not sure what you mean

00:54 <BtbN> FFmpeg has little to no influence on the output of the hwencs

00:55 <BtbN> it can set parameters, sure. But what's done with those is out of its control

00:55 <steven-netint> i was thinking I could probably setup a nightly regression pulling from FFmpeg-master, compile with the hw accel, and run internal tests against it. If test failure is found, check-in a patch to fix

00:56 <BtbN> And how do you determine if the breakage is in FFmpeg or the driver?

00:56 <BtbN> There is very little FFmpeg can or will do to break the hwenc wrappers

00:57 <steven-netint> look through FFmpeg-master commit history, analyze code change which could've caused breakage, likely fix vendor code as necessary

00:58 <BtbN> seems like largely pointless effort to me

00:58 <BtbN> I can't recall any of the hwenc ever having been just randomly broken

00:58 <BtbN> They're mostly just thin wrappers with not all that much actual work going on

00:58 <steven-netint> BtbN: my hwcontext was affected by a change in fftools after n7.1 :'(

00:59 <BtbN> in fftools?

00:59 <BtbN> your hwcontext? what?

01:00 <steven-netint> its the netint HW accel codecs/filters that aren't in FFmpeg-master

01:00 <steven-netint> Though, I'm trying to upstream it now on the mailing-list :)

01:00 <BtbN> so you mean some internal API got broken? Cause that can happen at any time.

01:01 <BtbN> There is no guarantee on anything that's not public API, it can change at any moment

01:02 <steven-netint> I fixed the issue in my hwcontext. Basically there was a bug which would not allow HWframes to be free'd after enc instance is closed. It was revealed by a commit on master branch (64f3feb) which closed enc instance before filter session. I fixed it now, but I was just thinking about how i can catch these kinds of issues in the future.

01:03 <steven-netint> *bug in my hwcontext

01:03 <BtbN> that's hard to say for out of tree internal code

01:03 <BtbN> Cause it can break in near unlimited ways

01:05 <steven-netint> yes, that is so. But, i'd think other large HW vendors would want to test FFmpeg master branch frequently to make sure their codecs/filters aren't broken when it comes time to make a FFmpeg release, no?

01:06 <steven-netint> Curious what they're doing about it if anyone knows

01:06 <BtbN> I don't think any hw vendors are directly involved like that

01:07 <BtbN> AMD and Nvidia both have engineers working on FFmpeg, but that's limited to occasional patches for new features and consultation for problems we run into

01:07 <BtbN> Intel probably has the largest team and effort going on, with their ffmpeg cartwheels and everything

01:07 System_Error has quit [Ping timeout: 244 seconds]

01:59 System_Error has joined #ffmpeg-devel

02:33 cone-459 has quit [Quit: transmission timeout]

02:44 rvalue has quit [Read error: Connection reset by peer]

02:45 rvalue has joined #ffmpeg-devel

03:04 <CounterPillow> I don't think a lot of places pay for mailing list toddler tantrums (the only one I know of is the bcachefs Patreon) so I'd say corporate involvement in ffmpeg has a fairly bleak outlook.

03:14 Martchus has joined #ffmpeg-devel

03:14 Martchus_ has quit [Ping timeout: 252 seconds]

03:33 jamrial has quit []

03:40 Mirarora has joined #ffmpeg-devel

03:55 Mirarora has quit [Quit: Mirarora encountered a fatal error and needs to close]

04:21 mkver has joined #ffmpeg-devel

05:00 System_Error has quit [Remote host closed the connection]

05:06 System_Error has joined #ffmpeg-devel

05:38 System_Error has quit [Remote host closed the connection]

05:43 TheVibeCoder has joined #ffmpeg-devel

05:48 TheVibeCoder has quit [Changing host]

05:48 TheVibeCoder has joined #ffmpeg-devel

05:56 <kasper93> BtbN: I don't agree testing hwdec is completely useless, if anything you can monitor status of it and go in-front of user reports that something "doesn't work"

05:56 <kasper93> also "thin wrapper" is maybe true for some hwdec

05:57 <kasper93> but for example Vulkan is huge and over the years had multitude fixes in ffmpeg itself

06:23 <Lynne> at least 2 rewrites

06:30 <beastd> steven-netint,BtbN,kasper93: I would say in general it would be possible to at least test some basic scenarios and check if it runs expected with no errors (or with errors if that is expected) and

06:30 <beastd> no crashes

06:31 <beastd> maybe some more superficial stuff could be checked.

06:35 <steven-netint> thanks for the input. I'm definitely interested in keeping issues from becoming user reports so I'll likely setup internal nightly tests of FFmpeg master branch with my hw codecs/filters once its upstreamed

06:39 <steven-netint> If anyone has input regarding my HW codecs/filters upstream I would appreciate it too :) (https://github.com/netintsteven/NI_FF_upstream/tree/upstream)

06:54 kasper93 has quit [Quit: kasper93]

06:54 kasper93 has joined #ffmpeg-devel

06:59 Coinflipper has quit [Quit: ]

07:03 Coinflipper has joined #ffmpeg-devel

07:13 cone-292 has joined #ffmpeg-devel

07:13 <cone-292> ffmpeg Peter Ross master:0fe9f25e7616: avcodec/adpcm: Sanyo LD-ADPCM decoder

07:18 averne has quit [Ping timeout: 244 seconds]

07:38 kasper93 has quit [Read error: Connection reset by peer]

07:43 ngaullier has joined #ffmpeg-devel

08:02 lemourin has quit [Quit: The Lounge - https://thelounge.chat]

08:04 lemourin has joined #ffmpeg-devel

08:59 <mkver> Lynne: ff_opus_rc_enc_end() can call ff_opus_rc_put_raw() to write 32 bits, which leads to an abort in av_zero_extend() if assert_level >= 2 (and is also not supported otherwise).

09:22 microchip_ has quit [Quit: There is no spoon!]

09:22 kasper93 has joined #ffmpeg-devel

09:23 microchip_ has joined #ffmpeg-devel

09:54 averne has joined #ffmpeg-devel

09:58 Anthony_ZO has quit [Remote host closed the connection]

10:04 averne has quit [Quit: quit]

10:04 averne has joined #ffmpeg-devel

10:13 cone-292 has quit [Quit: transmission timeout]

10:17 quietvoid has quit []

10:19 quietvoid has joined #ffmpeg-devel

10:27 <kierank> 01:33:46 <•jamrial> fate tests don't cover hw or external modules

10:27 <kierank> on the macs we could do that, no?

10:27 <kierank> 04:04:46 <CounterPillow> I don't think a lot of places pay for mailing list toddler tantrums (the only one I know of is the bcachefs Patreon) so I'd say corporate involvement in ffmpeg has a fairly bleak outlook.

10:27 <kierank> LOOOL

10:43 averne has quit [Quit: quit]

10:49 averne has joined #ffmpeg-devel

10:58 <averne> frankplow: "Is there some syntax element which can be used to indicate more than 1 coefficient is insignificant" -> yeah, AC coefficients are run-level encoded, with the runs being all zeroes except for the last element. There is no block subdivision, the transform always operates on 8x8 blocks

11:18 <frankplow> averne: In HEVC/VVC you have these subblock significance flags which sit between the usual run-level significance coding and individual coefficient flags. The transform itself still operates on the entire block, the subblocks are only relevant when coding the coefficients.

11:20 <frankplow> averne: But it doesn't sound like that's the case here. As I understand it the subblocks have two roles: bitrate reduction for large blocks via the significance flags, and they're easier to implement in hardware for varying block sizes (particularly in the case of VVC). For a codec with only 8x8 transform block sizes of course neither apply so I'm not sure why they'd lay out the scan like that.

11:30 microchip_ has quit [Quit: There is no spoon!]

11:31 microchip_ has joined #ffmpeg-devel

11:51 lemourin9 has joined #ffmpeg-devel

11:51 lemourin is now known as Guest7496

11:51 lemourin9 is now known as lemourin

11:57 lemourin has quit [Quit: The Lounge - https://thelounge.chat]

11:59 <averne> frankplow: ah I see, but yeah it doesn't sound like wwhat prores does. In general it's low-complexity and focuses strongly on efficient decoding rather than compression

12:05 Guest7496 has quit [Quit: The Lounge - https://thelounge.chat]

12:12 Guest7496 has joined #ffmpeg-devel

12:12 Guest7496 is now known as lemourin

12:17 lemourin has quit [Client Quit]

12:18 <fflogger> [editedticket] chrisw: Ticket #9281 ([avformat] wrong $Number$ value while reading mpd) updated https://trac.ffmpeg.org/ticket/9281#comment:2

12:23 lemourin has joined #ffmpeg-devel

12:23 jamrial has joined #ffmpeg-devel

12:55 <BtbN> kasper93: hwdec, absolutely. The output is well defined after all

12:55 <BtbN> but I wouldn't know how to test hwencs

12:55 <BtbN> the output can change at a whim of the driver at any time

12:57 <TheVibeCoder> add hwdec tests to fate now!

13:03 rvalue- has joined #ffmpeg-devel

13:03 rvalue has quit [Ping timeout: 244 seconds]

13:12 rvalue- is now known as rvalue

13:32 witchymary has quit [Ping timeout: 265 seconds]

13:33 <kasper93> BtbN: sure, it won't be bitexact. But you can run SSIM or any other similarity metic and check if it's not too low

13:34 <kasper93> if it drops below threshold, something is broken

13:39 <kasper93> fate should have arewefastyet.com style graphs for all workers to track performance regressions in ffmpeg :)

13:44 kasper93 has quit [Quit: kasper93]

13:44 kasper93 has joined #ffmpeg-devel

14:08 ShadowJK has quit [Ping timeout: 265 seconds]

14:19 ShadowJK has joined #ffmpeg-devel

14:20 <BtbN> That reminds me that I wanted to investigate dropping -fPIC for the fully static builds

14:20 <BtbN> Cause iirc PIC over just PIE has a performance penalty

14:20 <BtbN> though I do build with -fPIC -fno-semantic-interposition, which might offset the PIC penalty, but no idea how much

14:35 <BtbN> hm, in udp listen mode, when the incoming stream ends... ffmpeg just sits there. Indefinitely, even hitting q can't make it quit.

14:36 <BtbN> I had kinda hopes that the client shutting down the DTLS session would tear it down, but does not seem like it does

14:50 <fflogger> [newticket] nyanmisaka: Ticket #11655 ([avcodec] Cuda/nvdec hwaccel outputs P016LE instead of P010LE on 10bit video) created https://trac.ffmpeg.org/ticket/11655

15:03 <averne> kasper93: hwdec output should be bit-exact for most codecs

15:04 <fflogger> [editedticket] oromit: Ticket #11655 ([avcodec] Cuda/nvdec hwaccel outputs P016LE instead of P010LE on 10bit video) updated https://trac.ffmpeg.org/ticket/11655#comment:1

15:11 <fflogger> [editedticket] nyanmisaka: Ticket #11655 ([avcodec] Cuda/nvdec hwaccel outputs P016LE instead of P010LE on 10bit video) updated https://trac.ffmpeg.org/ticket/11655#comment:2

15:15 <fflogger> [editedticket] Balling: Ticket #11655 ([avcodec] Cuda/nvdec hwaccel outputs P016LE instead of P010LE on 10bit video) updated https://trac.ffmpeg.org/ticket/11655#comment:3

15:23 <Lynne> averne: all of them, I think

15:23 <Lynne> except the broken hwaccels

15:23 <Lynne> like vdpau

15:25 <averne> I wasn't able to match mpeg2 on nvdec. But yeah all the codecs that matter should be compliant with the conformance test suites

15:25 <fflogger> [editedticket] oromit: Ticket #11655 ([avcodec] Cuda/nvdec hwaccel outputs P016LE instead of P010LE on 10bit video) updated https://trac.ffmpeg.org/ticket/11655#comment:4

15:26 <BtbN> iirc mpeg2 is famously not specced to be bitexact

15:27 <fflogger> [editedticket] nyanmisaka: Ticket #11655 ([avcodec] Cuda/nvdec hwaccel outputs P016LE instead of P010LE on 10bit video) updated https://trac.ffmpeg.org/ticket/11655#comment:5

15:31 <fflogger> [editedticket] Balling: Ticket #11655 ([avcodec] Cuda/nvdec hwaccel outputs P016LE instead of P010LE on 10bit video) updated https://trac.ffmpeg.org/ticket/11655#comment:6

15:31 <Traneptora> I think the earlier DCTbased codecs aren't. og JPEG isn't either

15:34 System_Error has joined #ffmpeg-devel

15:36 cone-351 has joined #ffmpeg-devel

15:36 <cone-351> ffmpeg Timo Rothenpieler master:bf5f3f1f2e6e: avcodec/nvdec: fix 10bit output pixel formats

15:37 <fflogger> [editedticket] Timo Rothenpieler <timo@rothenpieler.org>: Ticket #11655 ([avcodec] Cuda/nvdec hwaccel outputs P016LE instead of P010LE on 10bit video) updated https://trac.ffmpeg.org/ticket/11655#comment:7

15:38 System_Error has quit [Remote host closed the connection]

15:42 <fflogger> [editedticket] oromit: Ticket #11655 ([avcodec] Cuda/nvdec hwaccel outputs P016LE instead of P010LE on 10bit video) updated https://trac.ffmpeg.org/ticket/11655#comment:8

15:43 <BtbN> Hm, I can't just change the output sw pixel format of a decoder, can I? That'd break a lot of commandlines and applications, no?

15:44 <BtbN> Cause nvdecs AV_PIX_FMT_P216 and AV_PIX_FMT_P016 are actually AV_PIX_FMT_P212 and AV_PIX_FMT_P012

15:49 <ePirat> shouldnt applications request specific format if they need that?

15:50 <BtbN> don't think that's an option?

15:50 <BtbN> https://github.com/FFmpeg/FFmpeg/blob/master/libavcodec/nvdec.c#L393 it's about these two

15:51 <BtbN> They're actually P212 and P012, but those didn't exist back whe nvdec (and nvenc) were implemented

15:51 <ePirat> hmm not sure what the usual way to go about this is

15:51 <BtbN> I've just used P216 and P016 back then, cause 12 bit fits in there fine, with just the low bits zero

15:52 <BtbN> But obviously outputting the actual pixel format would be nice

15:52 <BtbN> Linked the wrong lines. It's these two: https://github.com/FFmpeg/FFmpeg/blob/master/libavcodec/nvdec.c#L767

15:53 <BtbN> AV_PIX_FMT_YUV444P16 is also a weird case. It uses that for both 10 and 12 bit, cause there is no equivalent format in FFmpeg

15:54 <BtbN> AV_PIX_FMT_YUV444P10 and AV_PIX_FMT_YUV444P12 expect the data in the LSB iirc, but nvidia puts it into the MSB, like with the Pxxx formats

15:57 secondcreek has quit [Remote host closed the connection]

15:58 secondcreek has joined #ffmpeg-devel

16:00 System_Error has joined #ffmpeg-devel

16:04 <ePirat> BtbN, I think it should be fine at least with a major bump?

16:05 <BtbN> Yeah, gonna have to cook up version guards to it bumps automatically on the next major bump

16:07 <ePirat> mkver, do you have an opinion on my tee refactor?

16:08 <ePirat> I really want to make av_dict_get const and its the only thing in the way

16:08 <fflogger> [editedticket] nyanmisaka: Ticket #11655 ([avcodec] Cuda/nvdec hwaccel outputs P016LE instead of P010LE on 10bit video) updated https://trac.ffmpeg.org/ticket/11655#comment:9

16:13 <BtbN> Is there some magic somewhere, that if you enter just "p010" as a format= filter, that it appends the native endianness?

16:14 <BtbN> There sure is, right in av_get_pix_fmt

16:15 <kasper93> averne: I was responding to "but I wouldn't know how to test hwencs"

16:15 <Lynne> TheVibeCoder: nethier version 0 nor 1 uses each tile's qscale value?

16:15 <Lynne> why do they write that in the bitstream?

16:17 mkver has quit [Ping timeout: 265 seconds]

16:19 <TheVibeCoder> its always fixed value?

16:19 <TheVibeCoder> so not actually qscale?

16:37 mkver has joined #ffmpeg-devel

16:37 ngaullier has quit [Remote host closed the connection]

16:56 <mkver> ePirat: I don't consider the macro to be unreadable and think that your patch does not make it more readable; but I agree that abusing the AVDictionary API should stop.

17:33 <fflogger> [editedticket] oromit: Ticket #11655 ([avcodec] Cuda/nvdec hwaccel outputs P016LE instead of P010LE on 10bit video) updated https://trac.ffmpeg.org/ticket/11655#comment:10

17:33 <fflogger> [editedticket] oromit: Ticket #11655 ([avcodec] Cuda/nvdec hwaccel outputs P016LE instead of P010LE on 10bit video) updated https://trac.ffmpeg.org/ticket/11655#comment:11

17:42 <Lynne> TheVibeCoder: I'm seeing a difference here, between RAW and RAW HQ

17:43 <Lynne> 16429 (raw) vs 16399 (raw hq)

17:43 <Lynne> the raw (not hq) image also looks washed out, which leads me to believe that qscale is used somehow

17:44 <ePirat> mkver, I dont mind keeping the macro stuff if people prefer that, but I really want to get rid of it fiddling with the dict internals, especially for nearly no gain here

17:47 <Lynne> (qscale - 16384) >> 1 as a constant qmat seems to fit both, but this is just a guesswork on my part

17:48 <Lynne> can anyone decode prores raw?

17:48 <ePirat> I guess I can with videotoolbox?

17:50 mkver has quit [Ping timeout: 276 seconds]

17:51 <Lynne> ePirat: thanks, https://files.lynne.ee/prores_raw_hq.mov and https://files.lynne.ee/prores_raw_nonhq.mov

17:52 <TheVibeCoder> on Mac?

17:52 <TheVibeCoder> good luck

17:52 <Lynne> oh, and https://files.lynne.ee/prores_raw_version0.mov

17:52 <Lynne> adobe premier should also support it

17:53 <TheVibeCoder> Lynne: dynamic range between HQ and normal is different

17:56 <ePirat> Lynne, what output you want?

17:56 <Lynne> raw bayer, if possible

17:56 <Lynne> TheVibeCoder: I thought its always 12bits no matter what

17:57 <TheVibeCoder> not here

17:57 <TheVibeCoder> i see in histogram that one uses full range and another only middle one fourth

17:58 <TheVibeCoder> + waveform filter

17:58 <Lynne> it is meant to be half the bitrate exactly of hq

17:59 <Lynne> but I think that, in the absence of any other identification info, the qscale value should be able to compensate for that

18:00 <ePirat> Lynne, ok, will check in a bit

18:00 <ePirat> Lynne, are there already some patches that add the codec id and such?

18:01 <Lynne> yeah, give me a minute

18:04 <TheVibeCoder> in waveform filter i now see "banding"/"stripes" so maybe that needs fixing

18:05 <Lynne> ePirat: https://github.com/cyanreg/FFmpeg/tree/prores_raw

18:06 <fflogger> [editedticket] Balling: Ticket #11655 ([avcodec] Cuda/nvdec hwaccel outputs P016LE instead of P010LE on 10bit video) updated https://trac.ffmpeg.org/ticket/11655#comment:12

18:19 mkver has joined #ffmpeg-devel

18:36 cone-351 has quit [Quit: transmission timeout]

18:37 iive has joined #ffmpeg-devel

18:39 <TheVibeCoder> fps filter is balloning memory

18:39 <kierank> like all of libavfilter

18:40 <TheVibeCoder> in FFmpeg

18:51 <fflogger> [editedticket] Balling: Ticket #11578 ([ffmpeg] Waveform discontinuity in decoded E-AC-3) updated https://trac.ffmpeg.org/ticket/11578#comment:14

18:52 <fflogger> [editedticket] Balling: Ticket #10732 ([avcodec] avcodec_flush_buffers() not resetting E-AC-3 decoder) updated https://trac.ffmpeg.org/ticket/10732#comment:14

18:54 <TheVibeCoder> The Balling of FFmpeg

18:54 kurosu has joined #ffmpeg-devel

18:59 E81l7HT8T7sF9JdA has joined #ffmpeg-devel

19:00 <mkver> kasper93: Do you think it would be good to use plain malloc instead of av_malloc() for allocations in lavu with a dedicated deallocator (like AVBufferRef, AVBuffer, AVFrame, AVDictionary) that don't need to be overaligned?

19:01 <TheVibeCoder> also add cached AVFrame allocations

19:01 <TheVibeCoder> these are performance killer

19:03 <TheVibeCoder> very old bitcoin wallets reactivated?

19:03 <TheVibeCoder> someone cracked them?

19:08 <kasper93> mkver: possibly, can't tell if there would be tangible gain

19:09 E81l7HT8T7sF9JdA has quit [Quit: Leaving]

19:10 <BtbN> oh god... my addition of a "Data in MSB" pixel format completely confuses swscale

19:11 <BtbN> It works if I disable ASM, which makes it infinitely worse

19:17 <BtbN> I don't understand how the x86 assembly path can seemingly set one general yuv2yuvX to yuv2planeX and yuv2plane1, but the C path has a million different functions

19:17 <BtbN> Where does it get all the needed info from

19:18 <TheVibeCoder> haasn: what happened with swscale2?

19:19 minimal has joined #ffmpeg-devel

19:24 <BtbN> This is quite baffling. Does the assembly parts of swscale just assume they can handle all formats?

19:24 <BtbN> I see very little format checks there, it just blindly sets c->yuv2planeX and c->yuv2plane1 if the CPU supports the extensions

19:28 <BtbN> Also, I added debug prints to me C conversion function. And despite it not being used cause of the ASM, the debug print still happened? Does it just scale twice, one pointless C run?

19:28 <jamrial> maybe an autoinserted scaler?

19:29 <BtbN> No, I'm adding a new pixfmt to swscale

19:29 <BtbN> oh, you mean it's scaling twice?

19:29 <BtbN> But why would one instance use the asm function, and the other the c one?

19:29 <jamrial> true

19:29 <BtbN> ./ffmpeg -v debug -f lavfi -i testsrc2=duration=10:size=1920x1080:rate=60 -vf format=yuv444p10msb -an -sn -c:v hevc_nvenc -y out.mp4

19:29 <BtbN> is the commandline

19:30 <BtbN> so it should be only one conversion, from yuv420p to yuv444p10msb

19:30 <TheVibeCoder> nooooooooooooooooooooooooooooooooooo

19:30 <BtbN> I'm a bit baffled that this is such a problem for swscale, given stuff like P010 exists, which already has stuff in the MSB

19:35 bsFFFFFF has joined #ffmpeg-devel

19:36 <jamrial> BtbN: for 10bit, the ASSIGN_VSCALEX_FUNC macro checks that isSemiPlanarYUV(dstfmt) is false before setting anything

19:36 <jamrial> in x86/swscale.c

19:36 <jamrial> so p010 is not covered

19:37 <jamrial> for this new fmt, you may need to add a isDataInHighBits() check

19:37 <BtbN> But ff_sws_init_swscale_x86 near instantly sets c->yuv2planeX, without ANY conditions

19:37 <BtbN> except the CPU supporting the instructions

19:38 <BtbN> Which I think is the relevant function when converting _to_ yuv444p10msb, given I also had to set it in output.c for the C variant?

19:38 <jamrial> only if use_mmx_vfilter is set it seems

19:39 <BtbN> This is such a mess, my god

19:47 <compnn> can you set the testsrc2 output pixfmt ?

19:48 * compnn runs

19:51 <TheVibeCoder> yes, but testsrc2 supports only some formats

19:54 <BtbN> jamrial: adding !isDataInHighBits into there seems to have done it

19:54 <BtbN> but this seems insanely brittle

19:55 <TheVibeCoder> rm -rf libswscale/

20:36 witchymary has joined #ffmpeg-devel

20:38 <jamrial> BtbN: the whole logic for setting those function pointers is madness, yeah

20:46 <BtbN> I think it works now

20:46 <BtbN> but my god

20:47 kasper93 has quit [Quit: kasper93]

20:48 kasper93 has joined #ffmpeg-devel

20:52 bsFFFFFF has quit [Quit: bsFFFFFF]

21:04 kurosu has quit [Quit: Connection closed for inactivity]

21:11 <kierank> what is yuv444p10msb

21:11 <kierank> surely that's yuv444p16?

21:11 <kierank> (kinda)

21:13 <BBB> it shows that ffmpeg is just a wrapper of other things nowadays...

21:13 * BBB runs

21:13 <BtbN> Well, it is yuv444p16, but 10 bit.

21:14 <BtbN> Same as P010 is P016 but 10 bit.

21:15 Mirarora has joined #ffmpeg-devel

21:15 <BtbN> I don't think that format has an actual name anywhere. Better ideas for its name are welcome.

21:16 <jkqxz> Depends on range - it's yuv444p16 * ((2^16-2^6) / (2^16-1)) as full range, so can only be used interchangably if you don't mind a bit of error.

21:17 <jkqxz> (Also things might care about ensuring that the low bits don't contain anything funny.)

21:17 <BtbN> It's been causing issues that nvdec/nvenc use yuv444p16, where it's actually only 10 or 12 bit, with the lowest bits just zeroed out

21:17 <BtbN> So I'd kinda like to get away from that

21:18 <BtbN> AV_PIX_FMT_P012 and AV_PIX_FMT_P212 are the semi-planar equivalent

21:18 <jkqxz> Won't nvenc use a 2:10:10:10 4-byte container anyway? Microsoft mandates that for hardware 4:4:4, so it would mess with all use on windows if nvenc didn't match.

21:19 <jkqxz> For 12, yes, the layout will be the same as 16.

21:19 <BtbN> nvenc only supports 8 or 10 bit, no 12 bit so far

21:20 <BtbN> And for 10 bit, the input format is either P010 or this new YUV444P10MSB

21:20 <BtbN> The bigger problem is how nvdec outputs 10 and 12 bit 444 content exclusively in YUV444P10MSB, which right now is simply pretended to be AV_PIX_FMT_YUV444P16

21:20 <jkqxz> Lol. So it can't do interop with D3D and all microsoft stuff? Great plan.

21:20 <BtbN> hm?

21:21 <BtbN> For interop with D3D stuff, they added AV_PIX_FMT_X2RGB10 support

21:21 <jkqxz> Then just always use that and ignore this new format which nothing else cares about?

21:21 <BtbN> How would I use an RGB format when decoding YUV video?

21:22 <BtbN> And again, nvdec _exclusively_ decodes 10 and 12 bit 4:4:4 content to YUV444P10MSB

21:22 mkver has quit [Ping timeout: 265 seconds]

21:22 <jkqxz> Microsoft mandates that YUV 4:4:4 is decoded to 2:10:10:10, so nvidia must implement that to do D3D. Do they not expose it in nvidialand?

21:24 <BtbN> for decoding? no

21:24 <BtbN> 4:4:4 ends up as AV_PIX_FMT_YUV444P, AV_PIX_FMT_YUV444P10MSB or AV_PIX_FMT_YUV444P12MSB

21:25 <jkqxz> Can you ask them to expose the hardware which gives you Y410 in that API, since the hardware certainly does it for D3D?

21:26 <BtbN> What is Y410?

21:26 <jkqxz> YUV in 2:10:10:10.

21:26 <BtbN> We don't seem to have that either

21:26 <jkqxz> <https://learn.microsoft.com/en-us/windows/win32/medfound/10-bit-and-16-bit-yuv-video-formats>

21:26 <BtbN> Or is that AV_PIX_FMT_XV30?

21:27 <jkqxz> Yes, because we don't want people to think there is an empty alpha channel there.

21:27 <BtbN> Actually, cuviddec seems to have grown a cudaVideoSurfaceFormat_P216 at some point

21:28 <BtbN> So for 4:2:2 it's a non-issue

21:28 <BtbN> It's specifically 4:4:4 where they invented a new format

21:28 <BtbN> And adding anything to that API will have a round trip time of multiple years at best

21:29 <jkqxz> The P216 is still going to mess with you on the range (do you need to multiply by (2^16-2^4)/(2^16-1) or not?).

21:29 <BtbN> multiply?

21:29 <BtbN> It's documented to have to LSB zeroed out

21:30 <BtbN> which matches how our pix_fmts are defined

21:30 <jkqxz> Yes. So if you read a P016 value which was 12-bit at source then you need to correct for the fact that the low bits are zero but you want it to map to 1.

21:30 <jkqxz> (In a GPU sampler, most notably.)

21:30 <BtbN> I don't understand why

21:31 <BtbN> our AV_PIX_FMT_P010LE/BE says "zeros in the low bits". And nvdec delivers zeros in the low bits.

21:31 <BtbN> So mapping their P016 to P010 and P012 seems like it works perfectly

21:31 <jkqxz> If you sample a 16-bit value as UNORM then it maps 16'b1111_1111_1111_1111 -> 1.0. But that's wrong, because you want 16'b1111_1111_1111_0000 -> 1.0.

21:32 <jkqxz> This is well-understood on P010 as well, but since the pixfmt tells you the origin depth you can apply the correction.

21:32 <jkqxz> But if you are pretending all of these are P016 then it goes wrong.

21:32 <BtbN> The nvdec API is explicitly documented to return zeros in the unused low bits when decoding 10 or 12 bit content and selecting P016/P216/YUV444P_16BIT

21:33 <BtbN> So I just set nvdec to return cudaVideoSurfaceFormat_P016, and tell the ffmpeg side it's P010 or P012 respectively. And it's a perfect match.

21:33 <jkqxz> Yes, so sample 16'b1111_1111_1111_0000 as UNORM and you get .99977 rather than the 1.0 which you wanted

21:34 <BtbN> I don't know what you mean

21:34 <BtbN> it's never accessed as 16 bit value. All sides agree on the format.

21:34 <jkqxz> Ok, so you are telling the consumer that it is P010 or P012 so they can sample correctly?

21:34 <jkqxz> If you return a AV_PIX_FMT_P016 then it is wrong because the consumer needs to know the source format.

21:35 <BtbN> Like I said, the cuvid/nvdec API only has P016, P216 and YUV444_16Bit as possible output settings.

21:35 <BtbN> But it's documented to zero out the low bits in each, depending on the contents bit depth

21:36 <BtbN> So when cuvid is set to P016 for 10 bit content, P010 comes out, or P012 for 12 bit content.

21:36 <BtbN> Guess it saves them a few enum values or something.

21:36 <BtbN> The odd one out is cudaVideoSurfaceFormat_YUV444_16Bit...

21:37 <BtbN> Cause they just followed the pattern of all the Pxxx formats, and stuffed the data in the MSB, and zeroed out the unused LSB.

21:37 <BtbN> Which is not a format that exists anywhere else but in nvidia land as far as I can tell

21:37 lemourin has quit [Quit: The Lounge - https://thelounge.chat]

21:38 <iive> if you extend from e.g. 12 to 16 bits, you copy some of the valid bits into the "empty" part.

21:39 <iive> e.g. 0x55a could become 0x55a5 or 0x55aa

21:39 <jkqxz> It sounds like you can't avoid adding a new pixfmt for that in each bit depth if the API is like that.

21:39 <iive> this way 0xfff becomes 0xffff

21:39 <iive> and 0x000 stays 0x0000

21:39 <BtbN> Yeah, which is how yuv444p10msb and yuv444p12msb were born :D

21:39 <jkqxz> iive: Yes, if you have software to do the extension then you copy the high bits into the low bits (xyz0 -> xyzx), but in this case it's all in GPU surfces which can't be easily modified like that.

21:39 <BtbN> it's yuv444p16 with data in the high bits and zeroed out low bits

21:40 lemourin has joined #ffmpeg-devel

21:40 <BtbN> It's not hard to modify it, but even to modify it I need to first get it out of the decoder in that format

21:41 <BtbN> that's kinda the whole idea, I need to get it out of the hwdec in the correct format, so scale_cuda can turn it into something normal with a cuda kernel

21:42 <BtbN> jkqxz: btw, right now nvdec/cuviddec very much have the problem you described: https://github.com/FFmpeg/FFmpeg/commit/e275959146c37adc1cce04993f7a37fb45431333

21:42 <BtbN> Which is exactly what I'm trying to fix with the new formats.

21:43 <BtbN> I just needed an equivalent to P010/P210 and friends for YUV444P16

21:44 <BtbN> At least I think I can't sensibly switch those pixel formats outside of a major bump... Cause it's very much an API break if the decoder suddenly returns a different pixel format.

21:44 lemourin has quit [Client Quit]

21:48 lemourin has joined #ffmpeg-devel

21:51 <iive> BtbN, can't you have both pixel formats? The api used to probe multiple format until one is accepted.

21:52 <BtbN> well, the API for that negotiates the CUDA pix_fmts

21:52 <BtbN> I don't think there is anything to negotiate the sw_format inside of the CUDA one

21:53 <BtbN> I also think it's not really applicable here, since the currently returned formats are just flat out wrong

21:54 <iive> :}

21:54 <BtbN> There just was nothing better when it was originally implemented

21:54 Mirarora has quit [Quit: Mirarora encountered a fatal error and needs to close]

21:54 <BtbN> There is some code in scale_cuda that just assumed YUV444P16 is 10 bit

21:55 <BtbN> cause for the longest time, that'd always hold true cause nothing else was supported

21:55 <BtbN> but now it could be 12 bit as well. Or someone could upload actual 16 bit content from elsewhere

21:58 Mirarora has joined #ffmpeg-devel

22:02 lemourin has quit [Quit: The Lounge - https://thelounge.chat]

22:05 lemourin has joined #ffmpeg-devel

22:14 Mirarora has quit [Quit: Mirarora encountered a fatal error and needs to close]

22:18 <jamrial> BtbN: can we ask nvidia to support outputting p416?

22:19 <jamrial> instead of adding these msb planar formats

22:19 <BtbN> I do plan to ask them to support more sane formats

22:19 <BtbN> but like I said, the turnaround times for that are LONG, and it would also mean people with not even that old hardware will never be able to use it

22:19 <BtbN> Cause legacy drivers won't ever gain those new features

22:20 <jamrial> cards that only support legacy drivers probably can't decode 10 and 12bit 4:4:4 :p

22:20 bwu25 has joined #ffmpeg-devel

22:21 <jamrial> kinda weird that they output semiplanar for everything but 4:4:4

22:21 <BtbN> Well, by the time a feature addition like that would ever see the light of day, they will

22:21 <BtbN> 1000 series are legacy now

22:21 Mirarora has joined #ffmpeg-devel

22:22 <BtbN> yeah, that choice of format is SUPER weird

22:23 MisterMinister has joined #ffmpeg-devel

22:36 Mirarora has quit [Quit: Mirarora encountered a fatal error and needs to close]

22:58 kasper93_ has joined #ffmpeg-devel

22:58 kasper93 is now known as Guest4536

22:58 Guest4536 has quit [Killed (calcium.libera.chat (Nickname regained by services))]

22:58 kasper93_ is now known as kasper93

23:14 foorum has joined #ffmpeg-devel

23:15 <foorum> 𝐢𝐫𝐜.𝐬𝐮𝐩𝐞𝐫𝐧𝐞𝐭𝐬.𝐨𝐫𝐠 #𝐬𝐮𝐩𝐞𝐫𝐛𝐨𝐰𝐥

23:15 <foorum> .''. *''* :_\/_: .

23:15 <foorum> :_\/_: . .:.*_\/_* : /\ : .'.:.'.

23:15 <foorum> .''.: /\ : _\(/_ ':'* /\ * : '..'. -=:o:=-

23:15 <foorum> :_\/_:'.:::. /)\*''* .|.* '.\'/.'_\(/_'.':'.'

23:15 <foorum> : /\ : ::::: '*_\/_* | | -= o =- /)\ ' *

23:15 <foorum> '..' ':::' * /\ * |'| .'/.\'. '._____

23:15 <foorum> * __*..* | | : |. |' .---"|

23:15 <foorum> _* .-' '-. | | .--'| || | _| |

23:15 <foorum> .-'| _.| | || '-__ | | | || |

23:15 <foorum> |' | |. | || | | | | || |

23:15 <foorum> ___| '-' ' "" '-' '-.' '` |____

23:15 <foorum> ~~~~~GOD⁠~~BLE⁠SS⁠~~T⁠HE~~UNI⁠TED~~S⁠TAT⁠ES~~OF~~AMERICA~~~~~~

23:15 <foorum> DOMINAT⁠ING⁠ LAND, SEA, A⁠ND I⁠R⁠C⁠

23:15 <foorum> foorum kasper93 MisterMinister bwu25 lemourin witchymary minimal iive System_Error secondcreek ShadowJK rvalue jamrial microchip_ averne quietvoid Coinflipper TheVibeCoder Martchus Traneptora wbs odrling chainik15 IndecisiveTurtle BradleyS ramiro lexano ^Neo_ HarshK23 marth64_ philipl s55 Xe desmond-netint _whitelogger beastd vriska Chagall ubitux bilboed michaelni markh Thulinma hpkn mateo`

23:15 <foorum> bbbccc khrbtxyz compnn jlj35 pelotron natto17 kepstin Teukka blb realies9 arch1t3cht1 elvis_a_presley MetaNova nevcairiel kylophone CounterPillow gnafu sepro Son_Goku zsoltiv_ zsoltiv j45 wellsakus dariusz novaphoenix DodoGTA paulk av500 keith acryo uau kode54 Flat steven-netint DauntlessOne49 linkmauve galad grillo_0 aaabbb termos RT|AO_ toots5446 LaserEyess wyatt8740 klaxa fflogger Lynne APic

23:15 <foorum> Gramner BtbN tmatth bossjones Dmitri_Ovch Nightrose jluthra Kwiboo ocrete1 ocrete vjaquez faidz haasn drv JEEB jannau tortoise mindfreeze sdc Manouchehri termos__ englishm jkhsjdhjs pal welder another| signalhunter stazthebox SuperFashi thardin marcj emersion frankplow Hassan01 rodgort jdarnley kwizart pross abdo sm2n bcheng hbbs Labnan any1 nitroxis j-b nto bencoh BBB Moon_Rabbit xvaclav

23:16 foorum has left #ffmpeg-devel [USA #1]

23:16 compnn is now known as Compn

23:17 <Compn> too bad i dont display unicode or i might see what channel he was spamming

23:17 <kode54> it's that stupor nets shit again

23:18 <Compn> remind me to -r later

23:18 <kode54> if I'm here later

23:18 <kode54> and remember

23:19 <Compn> +r is account to join, +R is account to talk

23:20 <kode54> what I meant was, in case I remember to mention it later

23:20 <kode54> I'll still be here

23:20 <Compn> no i know what you meaqnt

23:20 <Compn> was trying to remember what +r and +R did

23:20 <Compn> respectively

23:22 <System_Error> Oh wow. Someone still bothers to advertise IRC nets and channels.

23:23 <kode54> it's always that one

23:23 <kode54> and they have a special treat for people who join that channel

23:23 <kode54> the network force joins you to like 1000 gibberish named channels upon join

23:24 <kode54> or so I have heard, since I've never done it myself

23:24 <kode54> it was the reason some curious people inspired irccloud to invent the mass-close command

23:27 <System_Error> Traditional IRC mass close been /join 0. Libera somehow removed this though.

23:30 markh has quit [Remote host closed the connection]