stikonas has quit [Quit: Konversation terminated!]
naoki has quit [Quit: naoki]
hexdump02 has joined #linux-rockchip
naoki has joined #linux-rockchip
hexdump02 has quit [Ping timeout: 244 seconds]
hexdump02 has joined #linux-rockchip
_whitelogger has joined #linux-rockchip
cbeznea has joined #linux-rockchip
naoki has quit [Quit: naoki]
franoosh has joined #linux-rockchip
eballetbo has joined #linux-rockchip
ldevulder has joined #linux-rockchip
franoosh has quit [Remote host closed the connection]
franoosh has joined #linux-rockchip
raster has joined #linux-rockchip
mripard has joined #linux-rockchip
raster has quit [Ping timeout: 256 seconds]
raster has joined #linux-rockchip
<Ermine>
Is it good idea to use 16K page size on Rockchip hardware?
<CounterPillow>
is it a good idea on any hardware?
<Ermine>
Asahi uses 16K pages, and iirc performance was among the reasons they use them
<Ermine>
(the other reason being Mac hardware expecting 16K pages or something like that)
<Dex2x>
the rpi as well?
<CounterPillow>
it's arguable whether it's a good idea there because now when they want to do x86 emulation they also have to translate page sizes, which is an additional layer
<CounterPillow>
I'm also not convinced it's a good idea for performance on low memory system because I'd imagine having a larger page size will result in a lot more fragmentation
<Dex2x>
low memory being < 4G ?
<CounterPillow>
but these are things you can measure for your specific workload I'd imagine, and then do actual statistical analysis on the results so you're not just going off vibes
chewitt has joined #linux-rockchip
raster has quit [Quit: Gettin' stinky!]
<diederik>
I've read somewhere (phoronix?) that it gave a slight performance boost ... on systems with 192GB RAM ...
<diederik>
"the other reason" is AFAICT the whole reason they went with 16K page size ... because the HW only supports 16K page size
dsimic has quit [Ping timeout: 255 seconds]
dsimic has joined #linux-rockchip
naoki has joined #linux-rockchip
naoki has quit [Client Quit]
<Ermine>
CounterPillow: indeed
chewitt_ has joined #linux-rockchip
chewitt has quit [Ping timeout: 248 seconds]
chewitt_ is now known as chewitt
<Dex2x>
wait does the RPiOS not default to 16k for the rpi5 ?
<Dex2x>
and if I am reading this datasheet correctly the RK3588 support 4k and 64k page size MMU's
<chewitt>
I'm also seeing a performance difference between RK3588 and RK3576
<chewitt>
3588 can play Jellyfish test files <= 300mbps without issues
<chewitt>
(both HEVC, HEVC 10-bit and H264)
<chewitt>
3576 manages <= 15mbps
<chewitt>
356X implemented by borrowing large amounts of the your vdpu381 decoder, manages <= 10mbps
<chewitt>
that seems rather low compared to 3588
<chewitt>
so I'm wondering if that's a known thing, or if there's any ideas on what the issue might be?
raster has joined #linux-rockchip
<detlevc>
chewitt That branch is probably not completely ready yet (And mostly tested on 3588 only so far). But yes, that is a big performance difference and I don't think there is that much more setup code. I can do some tracing later this week and see what the hardware does
<chewitt>
detlevc I did notice that the vendor kernel has 'hacks' for 3576 (and 356X)
<chewitt>
there's no mention of anything similar for 3576 or 3588 so perhaps it's just a leftover from older code
Danct12 has joined #linux-rockchip
asriel has joined #linux-rockchip
psydroid2 has joined #linux-rockchip
<ndufresne>
chewitt: on 356X, the hacks was about quick transition from H.264 to another supported CODEC, I don't think there has been any look to that upstream yet
<chewitt>
ahh, interesting
<ndufresne>
so typically use case, of a random file playlist with transition quick enough so the IP isn't being suspended in between
<ndufresne>
hopefully I understood that one correct
<chewitt>
doesn't sound like it would be the source of any performance issues (as I've seen)
franoosh has quit [Remote host closed the connection]
walter has quit [Quit: Konversation terminated!]
<ndufresne>
No, for that I would start by checking if all the clocks are at their expected speed
<ndufresne>
(note that mbps does not mean much in general)
<chewitt>
the assigned-clock-rate(s) for 3576 match the vendor dts
<chewitt>
same for 356x
<chewitt>
the assumption is that the vendor dts is correct :)
<ndufresne>
then check the AXI clocks, RAM speed, burst configuration
<ndufresne>
I'm assuming you have measure the speed on vendor BSP to be better ?
<chewitt>
I've never booted any RK board with a vendor image
<chewitt>
I'm not sure if that's a badge of honour or not? :)
<ndufresne>
chewitt: one thing special about rk3588 is that ram is fast, could be a difference, and that get solved by enabling AFBC (we aren't there yet though)
<ndufresne>
chewitt: well, to develop these, whenever we can, we uplift vendor driver, so its easier to switch from one to another
<chewitt>
I did spot a mention of 300Mhz 'max' somewhere in the 3568 TRM though
<chewitt>
so you'd hope the vendor driver came somewhere near that
<ndufresne>
RAM is slow on 356x, I once had a project that aborted because of that
<chewitt>
even ye olde Amlogic S905 with DDR3 is able to play higher bitrate Jellyfish files than a 3576 board with LPDDR5
<chewitt>
i'll need to learn some tricks to do actual scientific tests, but that alone suggests something is off..
raster has quit [Ping timeout: 256 seconds]
<ndufresne>
But you are testing performance with the display path, how can you know this isn't display related ?
<ndufresne>
I can tell you how to isolate and performance test the codecs using GStreamer, but I don't know exactly how that works with ffmpeg (should be feasible)
raster has joined #linux-rockchip
raster has quit [Ping timeout: 245 seconds]
ldevulder has quit [Ping timeout: 240 seconds]
chewitt has quit [Quit: Zzz..]
walter has joined #linux-rockchip
walter has joined #linux-rockchip
ldevulder has joined #linux-rockchip
franoosh has joined #linux-rockchip
godvino has quit [Ping timeout: 264 seconds]
stikonas has joined #linux-rockchip
eballetbo has quit [Quit: Connection closed for inactivity]