rpcsx

mirror of https://github.com/RPCSX/rpcsx.git synced 2025-12-06 07:12:14 +01:00

Author	SHA1	Message	Date
kd-11	6a9f234dc7	rsx: Fixup flip behaviour - handle_emu_flip is very heavy, only fire	2018-09-26 19:41:50 +03:00
kd-11	f72157bcec	rsx: Fix vertex attrib parsing	2018-09-25 22:03:35 +03:00
kd-11	a3d44b5e1f	rsx: Cleanup changes for the flip patch	2018-09-24 16:44:02 +03:00
Jake	699eadc84f	rsx: Move render flip from rsx queue command to flip command	2018-09-24 16:44:02 +03:00
Rui Pinheiro	35139ebf5d	Texture cache cleanup, refactoring and fixes	2018-09-24 15:26:40 +03:00
Rui Pinheiro	f3029b2b42	Change Cell->RSX map/unmap notifications This allows for further flexibility on the RSX side, allowing us to fix some bugs and crashes in later commits.	2018-09-24 15:26:40 +03:00
eladash	e0a676a3fe	rsx: Fix vertex arrays fetch with inlined draws	2018-09-24 13:25:05 +03:00
eladash	e6b68b260a	rsx: Improve FIFO mem faults handling increase the delay between faults, reduce log spam by allowing the messages to stack up	2018-09-20 01:05:40 +03:00
eladash	a8ea576b22	rsx/cellgcm: Implemet initialization registers reset	2018-09-20 01:05:40 +03:00
elad	d24f9194f7	typo fix shader's location is decremented by one to match cellGcm's constants.	2018-09-13 16:49:58 +03:00
eladash	b9ad578b00	rsx: Add a default shader address state	2018-09-13 16:49:58 +03:00
scribam	4cb98014a2	rsx: tiny zcull optimizations	2018-09-13 12:43:40 +03:00
eladash	efbd77deb4	rsx: dont silently ignore null shader address	2018-09-12 00:40:20 +03:00
Nekotekina	ca5158a03e	Cleanup semaphore<> (sema.h) and mutex.h (shared_mutex) Remove semaphore_lock and writer_lock classes, replace with std::lock_guard Change semaphore<> interface to Lockable (+ exotic try_unlock method)	2018-09-03 23:00:36 +03:00
kd-11	2e0ecb556c	rsx: Possible fix for UB data type consistency	2018-09-03 18:24:20 +03:00
kd-11	6399833182	rsx: Fix endianness order when immediate mode register is updated, but used as register lookup - Simplify the code by unifying all the register-backed memory	2018-09-03 18:24:20 +03:00
kd-11	c6e35706a3	vk: Support sw component swizzle decode because metal sucks	2018-08-23 22:54:56 +03:00
eladash	56d553f10d	Rsx: fix cmd jump over put register	2018-08-22 12:20:31 +03:00
kd-11	dd21e43ed5	rsx: Force disable draw reordering when capturing a frame	2018-08-18 16:14:30 +03:00
kd-11	0f36e87010	zcull: Improve the delay algorithm to be more consistent - Use proper time checking; depending on what is being done one 'tick' can be almost a millisecond long or several nanoseconds - Avoid spamming the system timer unless necessary	2018-08-18 16:14:30 +03:00
kd-11	38191c3013	rsx: Avoid acquiring the vm lock; deadlock evasion - A possible deadlock is still present if rsx is trying to get a super_ptr whilst the vm lock holder is in an access violation This patch makes this scenario very unlikely since each block need only be touched once	2018-08-18 16:14:30 +03:00
kd-11	373e02e91c	rsx: Timestamp accuracy workaround	2018-08-18 16:14:30 +03:00
kd-11	1200ca8172	rsx: Optimize hash_struct; vk cleanup	2018-08-18 16:14:30 +03:00
kd-11	d0165290b6	rsx: Refactor and fix framebuffer layout checks - Refactors shared code back into rsx core - Adds extra check to avoid contest confusion	2018-08-18 16:14:30 +03:00
kd-11	9f0fada17a	ZCULL: lower notice severity to avoid spam	2018-08-18 16:14:30 +03:00
kd-11	8800c10476	zcull synchronization tweaks - Implement forced reading when calling update method to sync partial lists - Defer conditional render evaluation and use a read barrier to avoid extra work - Fix HLE gcm library when binding tiles & zcull RAM	2018-08-18 16:14:30 +03:00
kd-11	3b47e43380	rsx: Synchronization rewritten - Do not do a full sync on a texture read barrier - Avoid calling zcull sync in FIFO spin wait - Do not flush memory to cache from the renderer side; this method is now obsolete	2018-08-18 16:14:30 +03:00
eladash	f349695a75	Rsx: rewrite address translation	2018-08-13 16:16:34 +03:00
eladash	9e380a4a4a	Rsx: avoid invalid cmds execution	2018-08-12 23:37:24 +03:00
eladash	c80eb1ba02	Rsx: fix CALL and RET cmd	2018-08-12 23:37:24 +03:00
Megamouse	eecb984689	RSX/Qt: add more performance overlay options to the gui	2018-07-28 23:10:45 +02:00
kd-11	19d808d378	rsx/gl: Minor cleanup and optimization - Track register change status - Remove unused gl classes	2018-07-22 17:19:59 +03:00
kd-11	e7f30640ef	rsx: Async shader compilation - Defer compilation process to worker threads - vulkan: Fixup for graphics_pipeline_state. Never use struct assignment operator on vk** structs due to padding after sType member (4 bytes)	2018-07-14 15:19:56 +03:00
kd-11	fa55a8072c	rsx: Improve vertex textures support - Adds proper support for vertex textures, including dimensions other than 2D textures - Minor analyser fixup, removes spurious 'analyser failed' errors - Minor optimizations for program state tracking	2018-07-12 18:02:28 +03:00
kd-11	4d40ed9dbd	rsx: Silence harmless warning	2018-07-07 16:20:33 +03:00
kd-11	2ca935a26b	vp: Improve vertex program analyser - Adds dead code elimination - Fix absolute branch target addresses to take base address into account - Patch branch targets relative to base address to improve hash matching - Bumps shader cache version - Enables shader logging option to write out vertex program binary, helpful when debugging problems.	2018-07-07 16:20:33 +03:00
eladash	345f92ab0a	rsx: more efficient command reading	2018-06-27 21:59:34 +03:00
kd-11	1730708f47	rsx: Rework memory protection management for framebuffer access - Avoid re-locking memory if there is no reason to do so (no draws issued) - Actively bound regions should always get written to the backing cache - Forcefully read memory during download if writes to the target have occured since last sync event	2018-06-26 20:07:20 +03:00
eladash	b456955688	rsx: fix hardcoded rsx allocation address	2018-06-24 10:57:30 +03:00
VelocityRa	dd0684b58a	overlays/perf_overlay: Make pos, font, opacity, margin configurable - Also some perf overlay refactoring	2018-06-18 22:34:26 +03:00
kd-11	6362942928	rsx: Avoid semaphore acquire deadlock	2018-05-30 13:30:23 +03:00
VelocityRa	c8d8a81ccd	overlays: Performance Overlay	2018-05-30 12:35:41 +03:00
eladash	23b380eb41	allow deallocations to unmap rsx mapped memory	2018-05-29 19:57:28 +03:00
kd-11	83f9be2524	rsx: Promote FIFO optimizations outside of strict mode - The benefits of FIFO optimizations are huge in some cases. The optimizations also do not break any tested applications so no need to disable with strict mode - A debug option is provided to disable this behaviour for testing	2018-05-29 13:54:30 +03:00
kd-11	493d4e8613	fixup - Improve invalidated region checks for performance	2018-05-24 10:36:04 +03:00
kd-11	92b5a705d8	fixup - locking	2018-05-23 19:07:08 +03:00
kd-11	b957eac6e8	rsx: Avoid calling any blocking callbacks from threads that are not rsx::thread - Defers on_notity_memory_unmapped to only run from within rsx context - Avoids passive_lock + writer_lock deadlock	2018-05-23 19:07:08 +03:00
kd-11	8fcd5c1e5a	rsx: Texture cache fixes 1. rsx: Rework section synchronization using the new memory mirrors 2. rsx: Tweaks - Simplify peeking into the current rsx::thread instance. Use a simple rsx::get_current_renderer instead of asking fxm for the same - Fix global rsx super memory shm block management 3. rsx: Improve memory validation. test_framebuffer() and tag_framebuffer() are simplified due to mirror support 4. rsx: Only write back confirmed memory range to avoid overapproximation errors in blit engine 5. rsx: Explicitly mark clobbered flushable sections as dirty to have them removed 6. rsx: Cumulative fixes - Reimplement rsx::buffered_section management routines - blit engine subsections are not hit-tested against confirmed/committed memory range Not all applications are 'honest' about region bounds, making the real cpu range useless for blit ops	2018-05-23 19:07:08 +03:00
kd-11	f6f45b8699	Native UI refactored (#4623 ) Refactor and improve native overlays	2018-05-20 23:05:00 +03:00
scribam	04ad49de4d	typos	2018-05-14 21:14:39 +04:00
kd-11	bff6060bd6	rsx: Improve puller state management - Properly identify puller spin primitives - Add a small wake delay after exiting a spin delay. Fixes desynchronization It seems real hw has a small delay between cell edits to commandbuffer memory at the GET address and the changes becoming visible to the DMA puller Simulated with a short busy_wait, large values will improve sync but degrade performance	2018-05-13 14:44:14 +03:00
kd-11	b7979d3f57	rsx/vk: Improvements and minor optimizations - Improve dirty state tracking affecting program state - vk: Refactor out transform constants upload into a separate channel to avoid if possible transform data uploads are quite expensive	2018-05-13 14:44:14 +03:00
kd-11	440a31ef18	rsx: Optimizations for program management	2018-05-13 14:44:14 +03:00
kd-11	a52ea7f870	rsx: Improve fragment and vertex program usage - Introduces a gpu program analyser step to examine shader contents before attempting compilation or cache search - Avoids detecting shader as being different because of unused textures having state changes - Adds better program size detection for vertex programs - Improved vertex program decompiler - Properly support CAL type instructions - Support jumping over instructions marked with a termination marker with BRA/CAL class opcodes - Fix SRC checks and abort - Fix CC register initialization - NOTE: Even unused SRC registers have to be valid (usually referencing in.POS)	2018-05-13 14:44:14 +03:00
Jake	75b40931fc	rsx: initial capture/replay functionality (#4510 ) * rsx: initial capture/replay functionality	2018-05-13 12:18:05 +03:00
kd-11	c5d1f30e82	rsx: Fix performance counters - Detect jump-to-self type idling	2018-04-25 19:14:36 +03:00
kd-11	a42b00488d	rsx: Texture fixes - gl/vk: Fix subresource copy/blit - gl/vk: Fix default_component_map reading - vk: Reimplement cell readback path and improve software channel decoder - Properly name the subresource layout field - its in blocks not bytes! - Implement d24s8 upload from memory correctly - Do not ignore DEPTH_FLOAT textures - they are depth textures and abide by the depth compare rules - NOTE: Redirection of 16-bit textures is not implemented yet	2018-04-25 19:14:36 +03:00
kd-11	cfd0b8a975	rsx: Fix alphakill	2018-04-05 01:06:50 +03:00
kd-11	93b2776604	rsx: Fix vertex input detection - Properly detect inline array registers vs constant value registers - Silence needless spam, 306E is 2D surface engiine, the assumption that y is multiplied by 306E pitch is not crazy	2018-04-05 01:06:50 +03:00
Jake	6d6d6fa827	dx12/vk/gl: implement use of vertex_data_base_index when calculating index	2018-03-30 13:30:04 +03:00
kd-11	7627ad04f1	rsx: Disable gamma control on WZYX textures - Gamma is seemingly used for (D/X/A)RGB only. Data textures are unaffected	2018-03-29 13:52:11 +03:00
kd-11	321c360dcb	rsx: Overhaul rendertarget sampling/shuffles - Reimplements render target views used for sampling - Optimizes access using an encoded control token - Adds proper encoding for 24-bit textures (DRGB8 -> ORGB/OBGR) - Adds proper encoding for ABGR textures (ABGR8 -> ARGB8) - Silence some compiler warnings as well - TODO: Real texture views for OGL current method is a hack	2018-03-25 13:31:06 +03:00
kd-11	9fc1740608	rsx/fp: Fragment program overhaul - Separate TXB from TXL: They are completely different! - Properly perform TMU emulation in the fragment shader. Implemens SRGB conversion and alphakill at the moment - Properly perform ROP emulation in the fragment shader. Implements FRAMEBUFFER_SRGB. While support on the chip looks to be incomplete (and wierd), it does work - Document some more bits in SHADER_CONTROL register	2018-03-25 13:31:06 +03:00
kd-11	27552891ad	rsx/fp: Improvements - Export some debug information in the free texture register space components zw Very useful when analysing renderdoc captures - Enable shadow comparison on depth as long as compare function is active and texture is uploaded for depth read Some engines (UE3) read all the components in the shader and use mul/mad with the result	2018-03-25 13:31:06 +03:00
kd-11	5f047034ae	rsx: Disable async count verification to avoid lockup due to zombie reports in ZCULL	2018-03-13 18:55:03 +03:00
kd-11	f00d9a7c7f	rssx" Halfplement alpha-to-coverage AA transparency	2018-03-13 18:55:03 +03:00
kd-11	2dce55d036	rsx: ZCULL synchronization fixes - Track asynchronous operations in RSX core - Add read barriers to force pending writes to finish. Fixes zcull delay flicker in all UE3 titles without forcing hard stall - Increase zcull latency as all writes should be synchronized now	2018-03-13 18:55:03 +03:00
kd-11	315798b1f4	rsx: ZCULL rewrite and other improvements - ZCULL unit emulation rewritten - ZCULL reports are now deferred avoiding pipeline stalls - Minor optimizations; replaced std::mutex with shared_mutex where contention is rare - Silence unnecessary error message - Small improvement to out of memory handling for vulkan and slightly bump vertex buffer heap	2018-03-13 18:55:03 +03:00
kd-11	dece1e01f4	rsx: Improve transform constants management - Removes the duplicate local_transform_constants - Resets the transform constants on every context reset - Simplifies the code abit which should make it faster - NOTE: Transform constants are persistent across context re-init events (VF5)	2018-03-13 18:55:03 +03:00
kd-11	0c8e4c0887	rsx: Improve FIFO commandlist flattening - TODO: Alot of work is still needed to execute draw commands out of order Thats the only solution to games sending many draw calls with high frequency of state changes	2018-03-13 18:55:03 +03:00
kd-11	84b8a08d26	rsx: Basic performance counters	2018-03-13 18:55:03 +03:00
kd-11	af1b13550b	rsx/vk: More optimizations - Do not bother rechecking the dirty sampler pool for hits. Its faster to create new sampler than to search the pool - Reserve some memory on vertex layout struct to reduce reallocation penalty	2018-03-13 18:55:03 +03:00
Jake	7233640cf0	rsx: add vertex data base to offset and mask before translating address	2018-03-07 16:57:20 +03:00
kd-11	bd297d079d	rsx: Minor optimizations	2018-02-16 16:14:54 +03:00
kd-11	a5500ebfa4	rsx: Fix disjoint draw range splitting - Fixes flickering and missing draws in R&C and other games such as Motorstorm Apocalypse and Okami HD when strict mode is disabled	2018-02-16 16:14:54 +03:00
Nekotekina	cce0ad0c35	Clean vm::ps3 namespace use	2018-02-09 17:49:37 +03:00
Jake	2f414f96bf	rsx: fix potential hang during thread close	2018-01-24 16:28:09 +00:00
kd-11	3d9e3a16f1	rsx/gl/vk: Fixes and optimizations - opengl driver optimization for nvidia. On nvidia glTextureBufferRange performance is horrendous -- Initialize texture buffer to whole buffer at startup and use absolute offsets to read data instead -- Over 2x performance in some cases (Resogun, TNT racers) - gl/vk: Do not flip non-existent display buffers. Fixes spec violation at boot in TNT racers demo - whitespace fixes for sys_rsx	2018-01-22 11:43:35 +03:00
kd-11	0a2992839b	rsx/gl/vk: Simulate z clipping with selective depth clamp - The scale offset matrix is fine but on real hardware the z results seem to be independent of near/far clipping distances -- If depth falls within near/far, clamp depth value to [0,1]	2018-01-19 12:03:57 +03:00
kd-11	cbc8bf01a1	cell/scheduler: Manage thread placement depending on cpu hardware	2018-01-19 12:03:57 +03:00
kd-11	71f69d1d48	rsx/overlays: Introduce 'native' HUD UI and implement some common dialogs (#4011 )	2018-01-17 19:14:00 +03:00
Jake	0477f8ed3c	rsx: add log for potential source of error	2018-01-14 20:50:55 +03:00
Jake	7ca2c444cc	rsx: Fix depth clipping	2018-01-14 20:50:55 +03:00
kd-11	ee009ec99c	rsx: Robustness fixes - Track last working state and reset to it if RSX starts to desync -- This is especially useful when running vulkan since the renderer will easily outpace the rest of the system when merely recording draw commands - Ignore empty sets -- Mark empty/invalid IB sets as having 0 element counts.	2018-01-02 21:17:56 +03:00
kd-11	0d0821e914	rsx: Pause FIFO queue when changing ctrl registers	2017-12-18 10:45:37 +03:00
kd-11	90c2324e47	rsx: Program cache fixes - Reorganize storage hash vs ucode hash - Scan for actual fragment program start in case leading NOPed code precedes the actual instructions -- e.g FEAR2 Demo has over 32k of padding before actual program code that messes up hashes	2017-12-04 18:22:18 +03:00
kd-11	90a3f3af30	rsx: Discard queue if RET is found without CALL	2017-12-01 21:00:50 +03:00
kd-11	de5a4fe083	rsx: Reimplement depth <-> RGBA reinterpretation code - Implements proper channel order for fp24-ARGB8 conversion - Takes swizzle remap into account when reconstructing source bytes	2017-12-01 21:00:50 +03:00
kd-11	ccc0383f75	vulkan: Implement overlay shader passes - Implements vk::overlay_pass and vk::depth_convert_pass - Also added a sanity check in RSX core for depth replace shaders	2017-12-01 21:00:50 +03:00
kd-11	680ca1d12a	rsx: Zcull refactoring and vulkan implementation	2017-12-01 21:00:50 +03:00
kd-11	0aaae000b3	rsx: Minor improvements	2017-12-01 21:00:50 +03:00
Zion Nimchuk	3a9ae2df9e	silence warnings in RSX stuff	2017-11-30 18:07:19 +03:00
Nekotekina	dbc9bdfe02	Implement set_ideal_processor_core (linux)	2017-11-15 21:00:02 +03:00
kd-11	1fa18757fc	rsx: Implement render-to-cubemap; Also simplify unnormalized samplers [WIP, DELETE SHADER CACHE, VERY SLOW] - Enables real-time cubemap reflections - TODO: Vulkan is broke; rsx is very slow with this feature	2017-11-08 13:15:34 +03:00
kd-11	0961a43997	rsx: Implement 1D<->2D image type casts	2017-11-08 13:15:34 +03:00
kd-11	bbcb6b6851	rsx: Fbo fixes 2 - Use AA mode to predict surface compression. Compression mode is useless without AA activated - Rewrites most image subresource fetch routines to use the new heuristic - Fix rsx:🧵:find_tile. FEED000(X) can be substituted for (X) in the code -- Fixes alot of failures when looking for tiled regions rsx: Fix antialiased unnormalized coords - scaling factors are inverse to allow proper coordinates to be computed in fs	2017-11-08 13:15:34 +03:00
kd-11	173d05b54f	rsx: Optimizations - Reimplement fragment program fetch and rewrite texture upload mechanism -- All of these steps should only be done at most once per draw call -- Eliminates continously checking the surface store for overlapping addresses as well addenda - critical fixes - gl: Bind TIU before starting texture operations as they will affect the currently bound texture - vk: Reuse sampler objects if possible - rsx: Support for depth resampling for depth textures obtained via blit engine vk/rsx: Minor fixes - Fix accidental imageview dereference when using WCB if texture memory occupies FB memory - Invalidate dirty framebuffers (strict mode only) - Normalize line endings because VS is dumb	2017-11-08 13:15:34 +03:00
Jake	e0d1ac676e	rsx: invalidate surface store address when tile is unbound	2017-10-28 12:46:20 +03:00
Jake	626b9f93c4	rsx: make dmactrl get 'readonly'	2017-10-28 12:46:20 +03:00
kd-11	9c9495621c	rsx: Fix critical bug concerning transient data layout in memory	2017-10-26 00:35:45 +03:00

1 2 3 4 5 ...

456 commits