rpcsx

mirror of https://github.com/RPCSX/rpcsx.git synced 2025-12-06 07:12:14 +01:00

Author	SHA1	Message	Date
kd-11	2985a39d2e	rsx: Rewrite async decompiler	2020-03-09 14:59:25 +03:00
Nekotekina	92e3eaf3ff	Fix signed-unsigned comparisons and mark warning as error (part 2).	2020-02-19 22:54:58 +03:00
Nekotekina	c0f80cfe7a	Use attributes for LIKELY/UNLIKELY Remove LIKELY/UNLIKELY macro.	2020-02-05 10:42:34 +03:00
Nekotekina	15391f45d0	Modernize RSX logging (rsx_log variable)	2020-02-01 11:52:22 +03:00
Maksim Derbasov	1abdee242a	small improvement (#7288 ) * small improvement * comments addressed Co-authored-by: kd-11 <15904127+kd-11@users.noreply.github.com>	2020-01-22 12:28:48 +00:00
kd-11	0b2f9f0f17	rsx: Add support for delayed shader discard. - Noticed a glitch on AMD hw and windows drivers where discard seems to affect entire 4x4 cells. - Dead fragments (outside the primitive boundary) could have their discards trigger as they do not have proper access to variables. - This introduces dead fragments along triangle edges, causing a diagonal line pattern across the screen that is very annoying.	2019-10-22 13:44:49 +03:00
kd-11	901942f24a	rsx: Replace pointless f32[4] restriction on texture parameters. - Use a struct instead to improve readability and remove pointless OpBitCast	2019-10-22 13:44:49 +03:00
kd-11	3c44065684	gl: Fix copypasta - MSAA is still unimplemented in OGL	2019-10-20 21:38:40 +03:00
kd-11	7aed9c3f13	gl: Add missing input declarations for 2-sided lighting	2019-09-30 21:52:43 +03:00
kd-11	eed32cf3a4	rsx: Decompiler fixups and improvements - Fix 2D coordinate sampling of W coordinate. W is actually HPOS.w and not 1. Z is however always 0. - Optimize register usage a bit Disassembling compiled SPV shows that global declaration results in less ops than using inout modifiers. Modifiers generate extra mov instructions.	2019-08-26 20:03:31 +03:00
kd-11	3e28e4b1e0	rsx/decompiler: Restructure program register behavior - Fix reading of varying registers in FP Different registers have different behavior - Always write to varying registers. If a register is not written to, it is initialized to (0, 0, 0, 1) - Reimplements two-sided lighting correctly without hacks - Also bumps shader cache version	2019-08-26 20:03:31 +03:00
kd-11	6be7c58fa4	glsl: Refactoring, cleanup and optimizations - Avoid generating unused code - Reduce GPR usage in emitted code	2019-06-25 20:50:54 +03:00
Lassi Hämäläinen	c963c51a60	Remove unnecessary header includes - Manually removed lot of unneeded #includes to clean code and reduce compilation time - Reordered some of the #includes to be in more logical order	2019-06-25 17:11:10 +03:00
scribam	370dcd9d6e	rsx: Apply Clang-Tidy fix "readability-simplify-subscript-expr"	2019-06-12 15:11:52 +03:00
scribam	44265aa27d	rsx: Apply Clang-Tidy fix "modernize-use-equals-default"	2019-06-12 15:11:52 +03:00
scribam	635695ac78	rsx: Apply Clang-Tidy fix "modernize-use-emplace"	2019-06-12 15:11:52 +03:00
scribam	8f2647555a	rsx: Apply Clang-Tidy fix "readability-redundant-string-init"	2019-06-12 15:11:52 +03:00
scribam	c4667133c4	gl/vk: Add constexpr to varying_registers and sync functions between the two backends	2019-06-12 10:59:31 +01:00
kd-11	60f3059d22	rsx: Compensate for nvidia's low precision attribute interpolation - The hw generates inaccurate values when doing perspective-correct interpolation of vertex output attributes and makes the comparison (a == b) fail even when they are a fixed constant value. - Increase equality tolerance when doing comparisons in fragment shaders for NV cards only to work around this issue. - Teepo fix	2019-04-25 16:23:05 +03:00
kd-11	463b1b220d	rsx: Improve accuracy of shadow compare Ops when non-integer depth formats are used - The fixed-point D24S8 format does special Z clamping during compare which matches PS3 behaviour - D32S8 is a floating point format and comparison with Dref > 1 always fails causing black edges/borders	2019-04-25 16:23:05 +03:00
kd-11	06a85f00d1	rsx: Shader decompiler cleanup and improvements - Improve support for float16_t by minimizing mixed inputs to functions (ambiguous overloads) - Minimize amount of downcasts in code by using opcode flags - Re-enable float16_t support for vulkan	2019-04-25 16:23:05 +03:00
kd-11	a668560c68	rsx: Use native half float types if available - Emulating f16 with f32 is not ideal and requires a lot of value clamping - Using native data type can significantly improve performance and accuracy - With openGL, check for the compatible extensions NV_gpu_shader5 and AMD_gpu_shader_half_float - With Vulkan, enable this functionality in the deviceFeatures if applicable. (VK_KHR_shader_float16_int8 extension) - Temporarily disable hw fp16 for vulkan	2019-04-25 16:23:05 +03:00
Nekotekina	bd9131ae1c	Implement fs::get_cache_dir Win32: equal to config dir for now Linux: respect XDG_CACHE_HOME if specified OSX: possibly incomplete	2019-01-13 14:45:36 +03:00
kd-11	4b79ef1ad9	rsx: Implement stencil mirror views - Implements a mirror view of D24S8 data that accesses the stencil components. Finishes the implementation of TEX2D_DEPTH_RGBA as the stencil component was previously missing from the reconstructed data - Add a few missing destructors Image classes are inherited a lot and I forgot to make the dtors virtual	2018-12-24 09:05:19 +03:00
kd-11	1ad76ad331	rsx: Restructure programs - Also re-enable pipeline optimizations	2018-11-30 23:51:25 +03:00
scribam	d7bb59cd99	c++17: use std::size	2018-09-06 13:15:59 +03:00
eladash	f349695a75	Rsx: rewrite address translation	2018-08-13 16:16:34 +03:00
eladash	3e433ef05c	create the shaderlog dir in Emu.Init()	2018-06-21 22:54:08 +04:00
scribam	04ad49de4d	typos	2018-05-14 21:14:39 +04:00
kd-11	321c360dcb	rsx: Overhaul rendertarget sampling/shuffles - Reimplements render target views used for sampling - Optimizes access using an encoded control token - Adds proper encoding for 24-bit textures (DRGB8 -> ORGB/OBGR) - Adds proper encoding for ABGR textures (ABGR8 -> ARGB8) - Silence some compiler warnings as well - TODO: Real texture views for OGL current method is a hack	2018-03-25 13:31:06 +03:00
kd-11	9fc1740608	rsx/fp: Fragment program overhaul - Separate TXB from TXL: They are completely different! - Properly perform TMU emulation in the fragment shader. Implemens SRGB conversion and alphakill at the moment - Properly perform ROP emulation in the fragment shader. Implements FRAMEBUFFER_SRGB. While support on the chip looks to be incomplete (and wierd), it does work - Document some more bits in SHADER_CONTROL register	2018-03-25 13:31:06 +03:00
kd-11	4804efc17d	rsx: Clear up confusion on depth writes. According to the NV_fragment_program spec, its not feasible to have 16-bit depth wries NOTE: NV_fragement_program precedes NV_fragment_program2 which is very close to what RSX consumes. It is hardware from that era afterall	2018-03-13 18:55:03 +03:00
Jake	3a9428a88e	rsx: Fix alpha test on VK/GL Moving alpha test outside of fp_main. This enforces alpha test is still done even if shader asm has early return	2018-03-09 18:06:20 +03:00
kd-11	33bcdd476c	glsl/fp/vp: Avoid shader clutter - Do not add unused subroutines in shaders unless necessary -- makes shaders easier to read and disassembled spir-v has less clutter - glsl: Replace switch block with lookup table	2018-01-30 21:16:43 +03:00
kd-11	320fb5f94c	rsx/fp: Fix alphakill	2017-12-22 20:08:14 +03:00
kd-11	cdd4fd9867	rsx/fp: Explicitly insert global functions. - Functions such as pack/unpack ops must exist before the shared gather functions are declared	2017-12-04 18:22:18 +03:00
kd-11	4e9160104a	rsx/vk/gl: Cleanup and refector glsl::getFunctionImpl - Both backends now generate very similar code	2017-11-08 13:15:34 +03:00
kd-11	1fa18757fc	rsx: Implement render-to-cubemap; Also simplify unnormalized samplers [WIP, DELETE SHADER CACHE, VERY SLOW] - Enables real-time cubemap reflections - TODO: Vulkan is broke; rsx is very slow with this feature	2017-11-08 13:15:34 +03:00
kd-11	fbb7186e66	rsx/gl: Addendum - Fix fragment shader to consume texture scale parameters	2017-11-08 13:15:34 +03:00
kd-11	12ab03b0b5	rsx/gl: Implement resolution scaling rsx: Revise wpos calculation to take resolution scale into account	2017-10-09 20:25:41 +03:00
kd-11	571dbfb7b1	rsx: Texture cache improvements - Limits buffer size to min 720 in the Y axis (1024 section causes conflicts in some cases - TODO) rsx: Fixups to allow large textures for blit operation - Also includes checks for both leaking sections and blit regions for vulkan hotfix for hanging when using WCB addendum - unlock both ro and no blocks before attempting to copy memory blocks gl: Fixups for ARB_explicit_uniform_location - Forces glsl v 430 to make use of the extension rsx/vk: Rework texture cache to minimize recursive access violations - Also modifies the vulkan commandbuffer begin/end/submit mechanism gl: Fix cached_texture_section::is_flushable to take memory protection into account rsx: Fix blit dst offset calculation	2017-09-21 16:17:06 +03:00
kd-11	07c83f6e44	gl: cleanup; fix program linkage on mesa using GL_ARB_explicit_uniform_location, also make use of ARB_multidraw	2017-09-21 16:17:06 +03:00
kd-11	73312fc363	rsx: Several fixes and improvements - Do not ignore non-centered pixel blitting - Register method ac00+16 - Bump texture memory heap to account for GPU texture scaling requirements (vulkan) - Explicit MRT location index output to better convey intent (openGL)	2017-09-21 16:17:06 +03:00
kd-11	f71f67c4ff	rsx: Make fragment state dynamic to reduce shader permutations	2017-08-26 21:53:54 +03:00
kd-11	6a707f515e	vk/gl: Factorize shared GLSL code - prep vulkan for shared glsl backend	2017-08-16 23:58:30 +03:00
kd-11	a69f19a9ab	rsx/decompilers: Avoid std::endl like the plague. It adds extra processing overhead	2017-06-29 13:13:19 +03:00
kd-11	b2e906f4cc	rsx: Code cleanup. Fixes several dozen warnings - Wrap unused parameters as comments to prevent C1400 - Fix sized variable conversions with explicit casts	2017-06-22 23:36:15 +03:00
kd-11	110974af0b	vk/gl: Fix sampling of shadow2D textures	2017-06-22 23:36:15 +03:00
kd-11	6a9eef0382	rsx/gl/vk: Enable use of native PCF shadows	2017-06-22 23:36:15 +03:00
shameful	e93c4c42b6	fix for shader fog mode exponential2_abs (#2781 ) * Update VKFragmentProgram.cpp added missing exponent parameter * fixed misplaced exponent in VKFragmentProgram.cpp parameter that belonged to pow() was being passed to exp() instead, causing the shader compilation to fail * fix for opengl fog_mode exponential2 same fix as the vulkan version * directx fog_mode exponential2 fix misplaced parameter * directx fog_mode exponential2_abs fix * vulkan fog_mode exponential2_abs fix * opengl fog_mode exponential2 fix	2017-05-14 12:56:24 +08:00

1 2 3

111 commits