rpcsx

mirror of https://github.com/RPCSX/rpcsx.git synced 2026-02-10 09:44:33 +01:00

Author	SHA1	Message	Date
kd-11	b957eac6e8	rsx: Avoid calling any blocking callbacks from threads that are not rsx::thread - Defers on_notity_memory_unmapped to only run from within rsx context - Avoids passive_lock + writer_lock deadlock	2018-05-23 19:07:08 +03:00
pauls-gh	f8a0be8c3e	Performance enhancement - Vulkan memory allocator (#4635 ) * Incorporates the vulkan memory allocator from the AMD GPUOpen project	2018-05-23 17:02:35 +03:00
kd-11	c9669818eb	Facepalm - overlays: Do not free self handle!!!!	2018-05-21 15:55:25 +03:00
kd-11	f6f45b8699	Native UI refactored (#4623 ) Refactor and improve native overlays	2018-05-20 23:05:00 +03:00
scribam	04ad49de4d	typos	2018-05-14 21:14:39 +04:00
kd-11	eccb57d4b8	vk: AMD primitive restart bug workaround - Emulate primitive restart with degenerate triangles	2018-05-13 14:44:14 +03:00
kd-11	b7979d3f57	rsx/vk: Improvements and minor optimizations - Improve dirty state tracking affecting program state - vk: Refactor out transform constants upload into a separate channel to avoid if possible transform data uploads are quite expensive	2018-05-13 14:44:14 +03:00
kd-11	440a31ef18	rsx: Optimizations for program management	2018-05-13 14:44:14 +03:00
kd-11	a52ea7f870	rsx: Improve fragment and vertex program usage - Introduces a gpu program analyser step to examine shader contents before attempting compilation or cache search - Avoids detecting shader as being different because of unused textures having state changes - Adds better program size detection for vertex programs - Improved vertex program decompiler - Properly support CAL type instructions - Support jumping over instructions marked with a termination marker with BRA/CAL class opcodes - Fix SRC checks and abort - Fix CC register initialization - NOTE: Even unused SRC registers have to be valid (usually referencing in.POS)	2018-05-13 14:44:14 +03:00
kd-11	7e32e7343a	vk: Reorganize handling of formats support - Formats support is linked to the physical device and by extension the logical device derived from it It therefore makes no sense to track this as a separate object. Simplifies parameter passing and template specialization. Also avoids corner cases with AMD hardware (where D24S8 is not supported)	2018-04-25 19:14:36 +03:00
kd-11	91a6091d26	rsx: Minor fixes - vk: Clear dirty textures before copying 'old contents' in case the old data does not fill the new region - rsx: Properly decode border color - seems to be in BGRA format - vk: better approximation of border color to better choose between the presets - vk: Individually clear color images outside render pass and without scissor - vk: Fix renderpass selection for clear overlay pass - vk: Include scissor region when emulating clear mask NOTES: - vk: Completely avoid using vkClearXXXXimage - its 'broken' on nvidia drivers Spec is vague about the function so its not an actual bug ClearAttachment is clearly defined as bypassing bound state which works correctly - TODO: Implement memory sampling to simulate loading precleared memory if cell used memset to preinitialize the framebuffer Autoclear depth to 1\|255 and color to 0 is hacky!	2018-04-25 19:14:36 +03:00
kd-11	a42b00488d	rsx: Texture fixes - gl/vk: Fix subresource copy/blit - gl/vk: Fix default_component_map reading - vk: Reimplement cell readback path and improve software channel decoder - Properly name the subresource layout field - its in blocks not bytes! - Implement d24s8 upload from memory correctly - Do not ignore DEPTH_FLOAT textures - they are depth textures and abide by the depth compare rules - NOTE: Redirection of 16-bit textures is not implemented yet	2018-04-25 19:14:36 +03:00
kd-11	63d9cb37ec	rsx: Framebuffer fixes Primary: - Fix SET_SURFACE_CLEAR channel mask - it has been wrong for all these years! Layout is RGBA not ARGB/BGRA like other registers Other Fixes: - vk: Implement subchannel clears using overla pass - vk: Simplify and clean up state management - gl: Fix nullptr deref in case of failed subresource copy - vk/gl: Ignore float buffer clears as hardware seems to do	2018-04-25 19:14:36 +03:00
kd-11	c5cd758700	rsx: Workaround for G8B8 render targets - Mainly affected are colormasks and read swizzles NOTES: - Writes to G write to the second and fourth component (YW) - Writes to B write to first and third component (XZ) - This means the actual format layout is BGBG (RGBA) making RG mapping actually GR - Clear does not seem to have any intended effect on this format (TLOU)	2018-04-25 19:14:36 +03:00
Talkashie	64992f758d	Fix typos (#4410 ) * MASSIVE TYPO FIX part 1 * ANOTHER HUUUUGE TYPO FIX part 2 * thank you :hcorion: for all of your help. I could not have done this without you	2018-04-08 01:01:39 +01:00
pauls-gh	a17025c465	Strict Rendering Mode (SRM) fix. Move old surface copy before texture upload. Fixes the following issues on Tales of Vesperia which requires SRM. - Blacked out scene after the sleeping dog now renders correctly - Ghosting effect. The ghosting was most noticeable as a delay between the character rendering and the cell shading around the character. This appears to be gone with this change.	2018-03-29 11:01:58 +03:00
kd-11	321c360dcb	rsx: Overhaul rendertarget sampling/shuffles - Reimplements render target views used for sampling - Optimizes access using an encoded control token - Adds proper encoding for 24-bit textures (DRGB8 -> ORGB/OBGR) - Adds proper encoding for ABGR textures (ABGR8 -> ARGB8) - Silence some compiler warnings as well - TODO: Real texture views for OGL current method is a hack	2018-03-25 13:31:06 +03:00
kd-11	aeebeed0f2	vk: Fix AMD primitive restart emulation when strict mode is active The restart emulation is there to keep the proprietary drivers from randomly crashing when using primitive restart	2018-03-25 13:31:06 +03:00
Megamouse	9d961f620b	rsx/Qt: add option to disable the shader compilation hint	2018-03-22 16:33:37 +04:00
kd-11	d13584f858	rsx: fixups gl/vk: Bump shader cache version gl/vk: Disable anisotropic override when strict mode enabled as it is proven to alter some games negatively gl: Clamp buffer view range to not exceed the backing buffer size. Also add assert for the same condition	2018-03-19 12:13:34 +03:00
kd-11	910fc54ee2	vk: Implement reading from cell if swap image isn't found	2018-03-13 18:55:03 +03:00
kd-11	f00d9a7c7f	rssx" Halfplement alpha-to-coverage AA transparency	2018-03-13 18:55:03 +03:00
kd-11	315798b1f4	rsx: ZCULL rewrite and other improvements - ZCULL unit emulation rewritten - ZCULL reports are now deferred avoiding pipeline stalls - Minor optimizations; replaced std::mutex with shared_mutex where contention is rare - Silence unnecessary error message - Small improvement to out of memory handling for vulkan and slightly bump vertex buffer heap	2018-03-13 18:55:03 +03:00
kd-11	a19ffba8e8	rsx: Simplify MRT blend setup; Enable separable MRT blend on vulkan and fix corner cases for GL	2018-03-13 18:55:03 +03:00
kd-11	e230867492	rsx: Properly implement raster window offsets	2018-03-13 18:55:03 +03:00
kd-11	84b8a08d26	rsx: Basic performance counters	2018-03-13 18:55:03 +03:00
kd-11	20d4c09a1c	rsx/vk/gl: Enforce format matching for render target resources. Fall back to raw data copy if match fails - Forces Bitcast of texture data if input format cannot possibly be the same as the existing texture format - rsx: Other minor improvements to texture cache :- - remove obsolete blit engine incompatibility warning. The texture will be re-uploaded if it is indeed incompatible - Implement warn_once and err_once to avoid spamming the log with systemic errors - Track mispredicted flushes - Reswizzle bitcasted texture data to native layout TODO: Also needs reshuffle according to input remap vector	2018-03-13 18:55:03 +03:00
kd-11	705820c430	rsx: Nvidia driver compatibility workarounds - Sanitize NaN values before they reach the driver. On nvidia (X * NaN = X)	2018-03-13 18:55:03 +03:00
kd-11	af1b13550b	rsx/vk: More optimizations - Do not bother rechecking the dirty sampler pool for hits. Its faster to create new sampler than to search the pool - Reserve some memory on vertex layout struct to reduce reallocation penalty	2018-03-13 18:55:03 +03:00
kd-11	8ccaabb502	vulkan: Optimize vertex data upload - Reuse buffer views as much as possible, vkCreateBufferView is slow on NV Implemented as a large sliding window, reuseable until it is filled	2018-03-13 18:55:03 +03:00
kd-11	77f2b521e1	vulkan: Swapchains reimplemented - Adds support for abstract implementations - Adds native windowing implementations for WIN32 and X11 as fallbacks when present support is lacking (headless configs)	2018-02-21 14:59:46 +03:00
kd-11	a8ab408f64	rsx: Account for null blit ops (memcpy) - Do not perform extra memory tasks if no actual image copy was performed	2018-02-16 16:14:54 +03:00
kd-11	661b8b006f	rsx: Add texture readback statistics to the texture cache and debug overlay	2018-02-16 16:14:54 +03:00
kd-11	1bd77c2f51	rsx: Add cache pattern checking to blit engine resources - Feature was implemented long ago but was not functional due to bugs	2018-02-16 16:14:54 +03:00
kd-11	c191a98ec3	vulkan API fixes - Fix for texture barriers - vulkan: Rework texture cache handling of depth surfaces - Support for scaled depth blit using overlay pass - Support proper readback of D24S8 in both D32F_S8 and D24U_S8 variants - Optimize the depth conversion routines with SSE - vulkan: Replace slow single element copy with std::memcpy - Check heap status before attempting blit operations - Bump guard size on upload buffer as well	2018-02-16 16:14:54 +03:00
kd-11	bd297d079d	rsx: Minor optimizations	2018-02-16 16:14:54 +03:00
kd-11	89c548b5d3	rsx: fbo fixes 2.5 - Implement flush-always behaviour to partially fix readback from a currently bound fbo - Without this, only the first read is correct, as more draws are added the results become 'wrong' - Fixes WCB and cpublit behviour - Synchronize blit_dst surfaces to avoid data loss when gpu texture scaling is used - Its still faster in such cases to disable gpu texture scaling but some types cannot be disabled without force cpu blit (e.g framebuffer transfers) - Memory management tuning - rsx: on-demand texture cache rescanning for unprotected sections - rsx: Only framebuffer resources are upscaled - Do not resize regular blit engine resources - Lazy initialize readback buffer when using opengl -- These measures should help minimize vram usage	2018-02-16 16:14:54 +03:00
kd-11	f20fd217f8	rsx: Reorganize framebuffer setup code - Fixes some fast paths for framebuffer creation and binding	2018-02-16 16:14:54 +03:00
kd-11	e7537cded5	vk: Also discard background if window is too small in vertical axis	2018-02-02 10:07:55 +03:00
kd-11	ea8bdda9a3	rsx/gl/vk: Support for swizzled? context surfaces - For some surfaces, dimensions are passed via the log2 bits rather than surface pitch -- This is similar to the setup for nv406e and probably means the surfaces are padded and swizzled	2018-02-02 10:07:55 +03:00
kd-11	4f7d3e5dc1	vk: Stuff - Remove subpass dependencies; transitions are handled via exicit imagememrybarriers - Reuse sampler objects whenever possible; create/delete cycles are not free	2018-01-30 21:16:43 +03:00
ZeroZero2018	cd8e97a7c6	Fix to B8 format render target swizzling (#4123 )	2018-01-29 21:58:25 +03:00
kd-11	4f01794713	Minor fixes - vulkan: Do not assume an aux frame context must exist in a well defined state as set in init_buffers() since the request might be external (via overlays path) - gl: Do not bother waiting for idle before servicing external flip requests - gl: Queue overlay cleanup requests to ensure only glthread attempts touching the context - overlays: Do not compute size metrics for invalid/unsupported glyphs	2018-01-22 11:43:35 +03:00
kd-11	3d9e3a16f1	rsx/gl/vk: Fixes and optimizations - opengl driver optimization for nvidia. On nvidia glTextureBufferRange performance is horrendous -- Initialize texture buffer to whole buffer at startup and use absolute offsets to read data instead -- Over 2x performance in some cases (Resogun, TNT racers) - gl/vk: Do not flip non-existent display buffers. Fixes spec violation at boot in TNT racers demo - whitespace fixes for sys_rsx	2018-01-22 11:43:35 +03:00
kd-11	0a2992839b	rsx/gl/vk: Simulate z clipping with selective depth clamp - The scale offset matrix is fine but on real hardware the z results seem to be independent of near/far clipping distances -- If depth falls within near/far, clamp depth value to [0,1]	2018-01-19 12:03:57 +03:00
kd-11	9ec2337192	rsx: Synchronization improvements - Always flush the primary queue and wait if not involking readback from rsx thread -- Should fix some instances of device_lost when using WCB -- Marked remaining case as TODO -- TODO: optimize amount of time rsx waits for external threads trying to read	2018-01-19 12:03:57 +03:00
kd-11	71f69d1d48	rsx/overlays: Introduce 'native' HUD UI and implement some common dialogs (#4011 )	2018-01-17 19:14:00 +03:00
Greg V	fbceec47b8	Add support for Vulkan on Wayland The variable VK_USE_PLATFORM_WAYLAND_KHR is actually used by the Vulkan header, so use it here too.	2018-01-11 12:26:41 +03:00
kd-11	d496dbecad	rsx: Implement depth clamping	2017-12-31 12:43:40 +03:00
kd-11	b1a1c0251f	rsx: Implement variable point size	2017-12-18 10:45:37 +03:00

1 2 3 4 5 ...

272 commits