rpcsx

mirror of https://github.com/RPCSX/rpcsx.git synced 2026-01-27 19:04:28 +01:00

Author	SHA1	Message	Date
Megamouse	a3eb5c2d63	More Header cleanup	2020-11-06 22:14:05 +01:00
kd-11	d012abd924	vk: Improve image transfer and scaling - Handle typeless src and dst with aliased typeless format - Optimize typeless transfers by only dealing with affected texels. * Eliminates redundant dst->typeless transfer of full image (very expensive) * Eliminates full src->typeless transfer of full image and replaces with only affected region * Requires significantly smaller output buffers, saving on VRAM cost	2020-09-22 12:19:54 +03:00
kd-11	85dd1b4ea9	vk: Fix fconvert job issues - Fix compilation bug caused by typo - Invert to/from for consistent declarations - Fix dst_swap when From == 2	2020-09-07 18:25:54 +03:00
kd-11	af9e217fa4	vk: Improve D16F handling - Adds upload and download routines. Mostly untested, which is why the error message exists	2020-08-30 09:26:37 +03:00
kd-11	e8274d5a59	vk: Fix depth format mismatch detection in copy_image	2020-08-29 02:03:09 +01:00
kd-11	d257ba5156	vk: Add some more diagnostic messages for unoptimized image transfer setups	2020-08-27 12:52:28 +03:00
kd-11	65ead08880	rsx: Refactor and improve image memory manipulation routines	2020-08-27 12:52:28 +03:00
kd-11	f6c6c04648	vk: Implement transport for D24S8_FLOAT data	2020-08-27 12:52:28 +03:00
kd-11	faaf28b41d	rsx: Basic support for creating depth float formats	2020-08-27 12:52:28 +03:00
kd-11	b41349546c	rsx: Proper support for typeless transform of ABGR framebuffers using the RGBA8 format	2020-08-12 20:19:19 +03:00
kd-11	b437794e92	vk: Improve nvidia speedhack for non-turing cards - Inverts the chip family check to skip any unidentified GPUs altogether	2020-06-28 22:54:58 +03:00
kd-11	d25ba03e82	vk: Lazy evaluate renderpass scope - Spamming the driver with renderpass open/close cycles is bad for performance.	2020-03-15 18:39:40 +03:00
Nekotekina Aux1	f2f3321952	Fix warnings in VKGSRender	2020-03-04 21:23:34 +03:00
gamerforEA	93552a5958	Apply some Clang-Tidy fixes	2020-02-27 00:38:55 +03:00
Nekotekina	c0f80cfe7a	Use attributes for LIKELY/UNLIKELY Remove LIKELY/UNLIKELY macro.	2020-02-05 10:42:34 +03:00
Nekotekina	15391f45d0	Modernize RSX logging (rsx_log variable)	2020-02-01 11:52:22 +03:00
kd-11	0a2b6a290d	vk: Fixup - Scaling is not needed for a direct typeless transfer!	2020-01-17 14:31:14 +03:00
kd-11	9b34f00241	vk: Optimize image transfers - Adds the same optimization/simplification steps to complex image transfer routines. Whenever possible, multi-step transfers are collapsed into a single operation.	2020-01-16 22:29:26 +03:00
kd-11	621fab2ad9	vk: Fix D32S8 interpolation by using integer interpolation instead of floating point - Interpolating floats is not the same as interpolating their bits! Use integer format to interpolate linearly for D32F formats instead of using R32F as intermediary	2020-01-16 11:12:08 +03:00
kd-11	086ecf4ba6	vk: Add some missing image memory barriers causing artifacting on AMD cards - There needs to be a memory barrier after each step. - TODO: Optimize scale_typeless_safe function	2020-01-16 11:12:08 +03:00
kd-11	3d96fe79cc	vk: Implement dynamic sized compute heap - Implements a dynamically sized compute heap to allow growing up the size if it is too small.	2020-01-15 15:42:36 +03:00
Nekotekina	377e7d2a73	C-style cast cleanup VI	2019-12-04 17:56:22 +03:00
kd-11	fd751e3e7b	rsx: Improve blit format mismatch detection	2019-11-19 13:18:15 +03:00
kd-11	4a0e1c79ed	rsx: Improve format validation for blit engine - Check all possible cases where format mismatch is possible. - Warn if a slow path is going to be taken. Should help with future optimizations.	2019-11-18 13:17:00 +03:00
kd-11	c415578e79	vk: Clamp buffer row length to never be less than declared width - Fixes some games with broken textures	2019-11-18 13:17:00 +03:00
Emmanuel Gil Peyrot	f76720ceb0	Remove extraneous ::narrow<int>() calls GSL’s gsl::span didn’t use the correct type for its index_type, which is why they were needed.	2019-11-09 19:30:06 +01:00
Emmanuel Gil Peyrot	ef368c5171	rsx: Replace gsl::byte with C++17’s std::byte	2019-11-09 19:30:05 +01:00
kd-11	99d71fdc2a	vk: Implement layer batching for the GPU swizzle decoder - Handles all LODs per layer meaning cubemaps are now fully handled in 6 passes instead of 6 * (log2(width)) passes. - Handles all LODs of a 3D texture in one pass as well. - The improvements do warrant dropping down the number of allowed compute invocations a bit	2019-11-05 22:07:22 +03:00
kd-11	1266b63135	vk: Enable gpu deswizzling	2019-11-05 22:07:22 +03:00
kd-11	9cd3530c98	rsx: Set up framework for hw deswizzle	2019-11-05 22:07:22 +03:00
kd-11	aa3eeaa417	rsx: Separate subresource_layout:dim_in_block and subresource_layout::dim_in_texel - These two are not always linked when working with compressed textures. The actual texels extend past the actual size of the image if the size is not aligned. e.g if height is 1, the real height is 4, but its not possible to determine this from the aligned size. It could be 1, 2, 3 or 4 for example. - Fixes image out-of-bounds writes when uploading from CPU	2019-10-29 20:03:54 +03:00
kd-11	ee0633f43a	vk: Add turing workaround - Turing crashes if using the depth->color transfer hack	2019-09-26 20:12:25 +03:00
kd-11	cc313b052f	rsx: Improve hit testing when scanning for overlapping surfaces - Calculate exact sizes when doing hit tests to avoid false negatives - Defer page checking until actually require to do memory setup - Introduce align2 helper to do non-pow2 alignments	2019-09-12 23:32:21 +03:00
kd-11	858014b718	rsx: Experiments with nul sink	2019-09-12 23:32:21 +03:00
kd-11	d1603fbb0b	vk: Crop malformed image descriptors - Some image descriptors (lle vdec?) are malformed with pitch being smaller than width - Crop these for now pending hardware tests	2019-09-08 18:22:27 +03:00
kd-11	cbce309199	vk: Fix depth_stencil scaling	2019-09-08 13:56:41 +03:00
kd-11	440d58f2ff	vk: Batch compute jobs when doing texture upload - Reduces overall number of invocations	2019-09-07 16:23:20 +03:00
kd-11	6aa0b49dbc	vk: Prefer using native alignment when uploading. - Allows using fast copy paths and reduces memory and compute footprint	2019-09-07 16:23:20 +03:00
kd-11	99fb6d6a5d	rsx: Allow GPU-accelerated stream manipulation when doing texture uploads	2019-08-30 21:46:19 +03:00
kd-11	141072023b	rsx: Fix handling of ARGB8 memory - Load into memory as straightforward BGRA - Fixes a bug in vulkan caused by byte shuffling in blit engine vs shader access - Removes the need for memory shuffling when transferring into a rendertarget	2019-08-21 21:17:15 +03:00
kd-11	dfe709d464	rsx: Surface cache restructuring - Further improve aliased data preservation by unconditionally scanning. Its is possible for cache aliasing to occur when doing memory split. - Also sets up for RCB/RDB implementation	2019-08-18 20:45:48 +03:00
JohnHolmesII	23094b48bb	Fix warnings related to -Wswitch Add default cases. Move default breaks to newline Add proper handling in some instances. Add missing enums to switches	2019-06-28 01:40:52 +03:00
kd-11	a245d9fb24	vk: DOuble general-purpose heap allocation to 128M and add a better diagnostic message for OOM	2019-05-19 17:33:21 +03:00
kd-11	e3cf3ab6b8	rsx: Minor fixes - Fix transfer scaling (inverted) - Fix under-estimated typeless acquisition when doing depth format scaling	2019-05-16 19:25:26 +03:00
kd-11	1c439f6198	vk: Fix some spec violations	2019-05-16 19:25:26 +03:00
kd-11	12dc3c1872	vk: Dynamic heap management to potentially fix ring buffer overflows - Allows checking one heap type at a time, on demand - Should avoid OOM situations unless inside an uninterruptible block	2019-04-09 13:40:54 +03:00
kd-11	a5ed30a8c0	rsx: Fixups for data cast operations via typeless transfer	2019-04-09 13:40:54 +03:00
kd-11	3249000511	rsx: Improvements to texture scanning - Removes CPU-only transforms that broke GPU-side code. -- Channels in GPU compute are laid out in cell-order, but CPU was uploading in favorable order and compensating with swizzles. -- This leads to 2 different layouts depending on the location of the data (CPU vs GPU) - Implement R8G8_R8B8 interleaved format decode - General improvements	2019-04-09 13:40:54 +03:00
kd-11	0f7af391d7	vk: Implement copy-to-buffer and copy-from-buffer for depth_stencil formats - Allows D24S8 and D32S8 transport via typeless channels - Allows uploading and downloading D24S8 data easily - TODO: Implement optional byteswapping to fix flushed readbacks with the same method	2019-04-09 13:40:54 +03:00
kd-11	bb65e45614	rsx: Implement GPU acceleration for rotated images	2019-03-17 21:50:11 +03:00

1 2

95 commits