rpcsx

mirror of https://github.com/RPCSX/rpcsx.git synced 2026-03-23 13:45:18 +01:00

Author	SHA1	Message	Date
kd-11	eed32cf3a4	rsx: Decompiler fixups and improvements - Fix 2D coordinate sampling of W coordinate. W is actually HPOS.w and not 1. Z is however always 0. - Optimize register usage a bit Disassembling compiled SPV shows that global declaration results in less ops than using inout modifiers. Modifiers generate extra mov instructions.	2019-08-26 20:03:31 +03:00
kd-11	3e28e4b1e0	rsx/decompiler: Restructure program register behavior - Fix reading of varying registers in FP Different registers have different behavior - Always write to varying registers. If a register is not written to, it is initialized to (0, 0, 0, 1) - Reimplements two-sided lighting correctly without hacks - Also bumps shader cache version	2019-08-26 20:03:31 +03:00
kd-11	fe6ff8622a	rsx: Decompiler fixups for conditional execution - Cond actually obeys vector mask	2019-08-26 20:03:31 +03:00
kd-11	f9aea076ae	rsx: Implement depth_buffer_float support. - Since this is transparent to the application at all time, it only becomes a problem when doing memory transfer or DEPTH->RGBA conversion in shaders.	2019-08-26 20:03:31 +03:00
kd-11	c67c97844e	rsx: Fixup for blit engine range calculations	2019-08-21 21:17:15 +03:00
kd-11	5d1b7eb945	rsx: Fix reference leaks in texture_cache<->surface_cache communication - Properly commit orphaned blocks not invalidating existing cache structures - Do not ignore overwritten objects when commiting as unprotected fbo. Avoids stale references to invalidated surface objects.	2019-08-21 21:17:15 +03:00
kd-11	141072023b	rsx: Fix handling of ARGB8 memory - Load into memory as straightforward BGRA - Fixes a bug in vulkan caused by byte shuffling in blit engine vs shader access - Removes the need for memory shuffling when transferring into a rendertarget	2019-08-21 21:17:15 +03:00
kd-11	9cd5325962	rsx: Free memory 'held hostage' by storage sections in the surface cache - Once the memory has been captured by another surface, release the allocation	2019-08-21 21:17:15 +03:00
kd-11	be98554b40	rsx: Fix surface split logic - Calculations are supposed to be done based on the properties of the outgoing surface	2019-08-21 21:17:15 +03:00
kd-11	67dac94704	rsx/fp: Zero-initialize FragDepth register to match hw	2019-08-21 21:17:15 +03:00
kd-11	dca29def5e	rsx: Temporary workaround for race condition in blit engine	2019-08-18 20:45:48 +03:00
kd-11	5e299111cc	rsx/vk: Restructure surface access barriers and implement RCB/RDB - Implements render target data load (aka Read Color Buffer/Read Depth Buffer) - Refactors vulkan surface barrier to be much cleaner. - Removes redundant surface barrier invocations after doing a merged load from surface cache. - Adds explicit access modes when gathering surfaces from cache.	2019-08-18 20:45:48 +03:00
kd-11	dfe709d464	rsx: Surface cache restructuring - Further improve aliased data preservation by unconditionally scanning. Its is possible for cache aliasing to occur when doing memory split. - Also sets up for RCB/RDB implementation	2019-08-18 20:45:48 +03:00
kd-11	1de90bdb1f	rsx: Improve aliased data preservation - Carve out inherited region if any - Perform pitch compatibility test before assigning old_surface	2019-07-27 16:09:21 +03:00
kd-11	e2574ff100	rsx: Support CSAA transparency without multiple rasterization samples enabled	2019-07-19 15:49:08 +03:00
kd-11	ea2f4d57fa	rsx: Fixups	2019-07-17 13:29:42 +03:00
kd-11	113a49e00c	rsx: Handle cyclic references when doing memory inheritance	2019-07-17 13:29:42 +03:00
kd-11	34b06453f9	rsx: Handle lost data due to unused data sections - After splitting, the sections may not be referenced at all for anything other than just pixel storage - In such cases, either merge down or sample from the upstream source instead	2019-07-17 13:29:42 +03:00
kd-11	009e01a347	rsx: Set up for multi-section inheritance	2019-07-17 13:29:42 +03:00
kd-11	fc09572648	rsx: Implement texel border decode - Texel borders are no longer actually supported in modern APIs - Removes the border texels and uses border color instead which is incorrect but should work fine	2019-07-11 13:22:13 +03:00
kd-11	d8f753f1e8	rsx: Do not allow framebuffer surfaces that exceed their allocated pitch dimensions - Truncate surfaces to forcefully fit inside the declared region	2019-07-11 13:22:13 +03:00
kd-11	c072c511a1	rsx: Add support for slice padding rows when gathering slices for cubemap/3d	2019-07-09 16:27:59 +03:00
kd-11	ad10eb391e	vk: Reuse discarded memory whenever possible instead of recreating new objects - Memory allocations are surprisingly expensive when spammed	2019-07-03 15:52:16 +03:00
kd-11	71e809a78b	rsx: Implement dma abort in case of a reset after misprediction	2019-07-03 15:52:16 +03:00
Eladash	43f919c04b	Fixup after #6143 (#6146 ) vm::spu max address was overflowing resulting in issues, so cast to u64 where needed. Fixes #6145. Use vm::get_addr instead of manually substructing vm::base(0) from pointer in texture cache code. Prefer std::atomic_thread_fence over _mm_?fence(), adjust usage to be more correct. Used sequantially consistent ordering in semaphore_release for TSX path as well. Improved memory ordering for sys_rsx_context_iounmap/map. Fixed sync bugs in HLE gcm because of not using atomic instructions. Use release memory barrier in lwsync for PPU LLVM, according to this xbox360 programming guide lwsync is a hw release memory barrier. Also use release barrier where lwsync was originally used in liblv2 sys_lwmutex and cellSync. Use acquire barrier for isync instruction, see https://devblogs.microsoft.com/oldnewthing/20180814-00/?p=99485	2019-06-29 18:48:42 +03:00
Eladash	1ee7b91646	Refactoring (#6143 ) Prefer vm::ptr<>::ptr over vm::get_addr. Prefer vm::_ptr/base over vm::g_base_addr with offset. Added methods atomic_t<>::bts and atomic_t<>::btr . Removed obsolute rsx:🧵:Read/WriteIO32 methods. Removed wrong check in semaphore_release. Added handling for PUTRx commands for RawSPU MFC proxy. Prefer overloaded methods of v128 instead of _mm_... in VPKSHUS ppu interpreter precise. Fixed more potential overflows that may result in wrong behaviour. Added io/size alignment check for sys_rsx_context_iounmap. Added rsx::constants::local_mem_base which represents RSX local memory base address. Removed obsolute rsx:🧵:main_mem_addr/ioSize/ioAddress members.	2019-06-29 01:27:49 +03:00
JohnHolmesII	ebb1ae6408	Properly ignore SIMD macros to avoid warning	2019-06-28 01:40:52 +03:00
JohnHolmesII	be521ff0ab	Fix warnings related to parentheses	2019-06-25 20:36:32 -07:00
kd-11	6a32f716db	rsx: Reimplement vertex layout streaming - Remove string comparisons from the hot-path! - Use attribute streaming and push constants to avoid forcing a descriptor block copy every other draw call/pass. While this isn't so bad on nvidia cards, it makes AMD cards a slideshow.	2019-06-25 20:50:54 +03:00
kd-11	358169507c	rsx: Use SSE to accelerate index buffer uploads	2019-06-25 20:50:54 +03:00
kd-11	c9501b60ab	rsx: Use explicit fma for MAD emulation	2019-06-25 20:50:54 +03:00
kd-11	6be7c58fa4	glsl: Refactoring, cleanup and optimizations - Avoid generating unused code - Reduce GPR usage in emitted code	2019-06-25 20:50:54 +03:00
Lassi Hämäläinen	c963c51a60	Remove unnecessary header includes - Manually removed lot of unneeded #includes to clean code and reduce compilation time - Reordered some of the #includes to be in more logical order	2019-06-25 17:11:10 +03:00
Lassi Hämäläinen	e9e87b8bd9	Add missing #includes to header files - Multiple header files where missing #includes to other headers that where used in the header. Correct header was included in correct order in source files which caused everything to compile. - Added missing #includes so header files correctly include all their dependencies and fixes problems with IDEs being unable to parse headers correctly due to missing symbols	2019-06-25 17:11:10 +03:00
kd-11	86119f58d6	rsx: Typo fix	2019-06-14 16:19:52 +03:00
kd-11	9d166c5bed	rsx: Force invalidate of children by issuing a resolve notification whenever the parent is written to - Fixes successive reads of an antialiased surface that is still bound between reads	2019-06-14 16:19:52 +03:00
kd-11	8a1cf2c913	rsx: Attempt to reduce stencil load overhead for nvidia cards	2019-06-14 16:19:52 +03:00
kd-11	c655036920	rsx/fp: Ease pressure on fragment shaders when emulating clamp16 - TODO: Option to completely skip clamping in some architectures as it is not needed in most games - Mostly affects older GPUs that do not have access to native fp16	2019-06-14 16:19:52 +03:00
kd-11	bca5f94b3f	rsx: Add option to toggle MSAA	2019-06-14 16:19:52 +03:00
kd-11	ea8409dcfd	rsx: Re-enable optional sample-to-pixel transformation	2019-06-14 16:19:52 +03:00
kd-11	acb14320da	rsx: Fixup for resolution scaling support	2019-06-14 16:19:52 +03:00
kd-11	4a5bbba277	rsx: Enable MSAA - vk: Enable depth buffer resolve+unresolve - vk: Add AMD stenciling extension support - rsx: Temporarily disables MSAA-compatible hacks such as transparency AA - TODO: Add paths to optionally disable MSAA	2019-06-14 16:19:52 +03:00
kd-11	f6f3b40ecc	rsx: Fix AA coordinate transforms - Requires native_pitch value to take samples into account	2019-06-14 16:19:52 +03:00
kd-11	655eff29e8	rsx: Refactoring and cleanup after d3d12 separation - Remove deprecated functionality - Refactor to share code between common routines	2019-06-14 16:19:52 +03:00
kd-11	0d906d6974	rsx: Remove surface aa_mode hacks	2019-06-14 16:19:52 +03:00
scribam	13671d9684	rsx: Apply Clang-Tidy fix "modernize-loop-convert" + const when relevant	2019-06-12 15:11:52 +03:00
scribam	1e327ad31b	rsx: Apply Clang-Tidy fix "readability-avoid-const-params-in-decls"	2019-06-12 15:11:52 +03:00
scribam	44265aa27d	rsx: Apply Clang-Tidy fix "modernize-use-equals-default"	2019-06-12 15:11:52 +03:00
scribam	635695ac78	rsx: Apply Clang-Tidy fix "modernize-use-emplace"	2019-06-12 15:11:52 +03:00
scribam	cba828384d	rsx: Apply Clang-Tidy fix "modernize-pass-by-value"	2019-06-12 15:11:52 +03:00

1 2 3 4 5 ...

573 commits