rpcsx

mirror of https://github.com/RPCSX/rpcsx.git synced 2026-01-06 16:49:59 +01:00

Author	SHA1	Message	Date
kd-11	60f3059d22	rsx: Compensate for nvidia's low precision attribute interpolation - The hw generates inaccurate values when doing perspective-correct interpolation of vertex output attributes and makes the comparison (a == b) fail even when they are a fixed constant value. - Increase equality tolerance when doing comparisons in fragment shaders for NV cards only to work around this issue. - Teepo fix	2019-04-25 16:23:05 +03:00
kd-11	463b1b220d	rsx: Improve accuracy of shadow compare Ops when non-integer depth formats are used - The fixed-point D24S8 format does special Z clamping during compare which matches PS3 behaviour - D32S8 is a floating point format and comparison with Dref > 1 always fails causing black edges/borders	2019-04-25 16:23:05 +03:00
kd-11	06a85f00d1	rsx: Shader decompiler cleanup and improvements - Improve support for float16_t by minimizing mixed inputs to functions (ambiguous overloads) - Minimize amount of downcasts in code by using opcode flags - Re-enable float16_t support for vulkan	2019-04-25 16:23:05 +03:00
kd-11	a668560c68	rsx: Use native half float types if available - Emulating f16 with f32 is not ideal and requires a lot of value clamping - Using native data type can significantly improve performance and accuracy - With openGL, check for the compatible extensions NV_gpu_shader5 and AMD_gpu_shader_half_float - With Vulkan, enable this functionality in the deviceFeatures if applicable. (VK_KHR_shader_float16_int8 extension) - Temporarily disable hw fp16 for vulkan	2019-04-25 16:23:05 +03:00
eladash	6f76e34104	rsx: Fix race on clearing native_ui vs emu_requested flag	2019-04-20 01:04:41 +03:00
kd-11	3249000511	rsx: Improvements to texture scanning - Removes CPU-only transforms that broke GPU-side code. -- Channels in GPU compute are laid out in cell-order, but CPU was uploading in favorable order and compensating with swizzles. -- This leads to 2 different layouts depending on the location of the data (CPU vs GPU) - Implement R8G8_R8B8 interleaved format decode - General improvements	2019-04-09 13:40:54 +03:00
kd-11	74eeacd091	vk/gl: Improve memory tag sync and test - Properly pass parameters such as rsx-pitch to the surface store - Do not crash if a surface fails verification in flip, use fall-back instead	2019-03-17 21:50:11 +03:00
kd-11	04dda44225	rsx: Properly generate render target data with all parameters provided - Build-up to variable-sized framebuffers and AA implementation - Also allows accurate range calculation for our hit testing	2019-03-10 16:09:05 +03:00
kd-11	417a2e6731	rsx: Refactor index buffers - Index offset is ignored anyway and only used to calculate vertex attribute divisor index - Specialized optimization for untouched xfer without primitive restart	2019-01-25 14:34:22 +03:00
Nekotekina	bd9131ae1c	Implement fs::get_cache_dir Win32: equal to config dir for now Linux: respect XDG_CACHE_HOME if specified OSX: possibly incomplete	2019-01-13 14:45:36 +03:00
kd-11	15488eb247	rsx: Avoid unnecessarily touching framebuffer memory - Do not bind companion framebuffer when clearing single aspect; let the contest mechanism sort it out instead - Do not prematurely tag framebuffers, instead only do so at write-confirmation time. Should avoid false tagging if setup does not allow a render to occur.	2019-01-06 10:44:40 +03:00
kd-11	696b91cb9b	rsx: Reimplement conditional execution in shaders - Per-channel conditional execution introduces RAW hazards all over the place - Its cheaper to process both branches and select between the two - Also improves ShaderVariable functionality to allow functionality such as match_size and taking complex variables as inputs	2018-12-24 09:05:19 +03:00
kd-11	677b16f5c6	rsx: Fixups - Also fix visual corruption when using disjoint indexed draws - Refactor draw call emit again (vk) - Improve execution barrier resolve - Allow vertex/index rebase inside begin/end pair - Add ALPHA_TEST to list of excluded methods [TODO: defer raster state] - gl bringup - Simplify - using the simple_array gets back a few more fps :)	2018-11-30 23:51:25 +03:00
kd-11	e01d2f08c9	rsx: Refactor FIFO - Removes fifo structures from common RSXThread - Sets up a dedicated FIFO controller - Allows for configurable queue optimizations	2018-11-30 23:51:25 +03:00
Nekotekina	1b37e775be	Migration to named_thread<> Add atomic_t<>::try_dec instead of fetch_dec_sat Add atomic_t<>::try_inc GDBDebugServer is broken (needs rewrite) Removed old_thread class (former named_thread) Removed storing/rethrowing exceptions from thread Emu.Stop doesn't inject an exception anymore task_stack helper class removed thread_base simplified (no shared_from_this) thread_ctrl::spawn simplified (creates detached thread) Implemented overrideable thread detaching logic Disabled cellAdec, cellDmux, cellFsAio SPUThread renamed to spu_thread RawSPUThread removed, spu_thread used instead Disabled deriving from ppu_thread Partial support for thread renaming lv2_timer... simplified, screw it idm/fxm: butchered support for on_stop/on_init vm: improved allocation structure (added size)	2018-10-19 22:22:35 +03:00
eladash	83b6c98563	rsx: Fix u16 index arrays overflow Force u32 index array destinations to avoid overflows when adding vertex base index.	2018-10-08 16:39:47 +03:00
Lassi Hämäläinen	7aef811ff7	CMake: Refactor CMake build (#5032 ) * CMake: Refactor build to multiple libraries - Refactor CMake build system by creating separate libraries for different components - Create interface libraries for most dependencies and add 3rdparty::* ALIAS targets for ease of use and use them to try specifying correct dependencies for each target - Prefer 3rdparty:: ALIAS when linking dependencies - Exclude xxHash subdirectory from ALL build target - Add USE_SYSTEM_ZLIB option to select between using included ZLib and the ZLib in CMake search path * Add cstring include to Log.cpp * CMake: Add 3rdparty::glew interface target * Add Visual Studio CMakeSettings.json to gitignore * CMake: Move building and finding LLVM to 3rdparty/llvm.cmake script - LLVM is now built under 3rdparty/ directory in the binary directory * CMake: Move finding Qt5 to 3rdparty/qt5.cmake script - Script has to be included in rpcs3/CMakeLists.txt because it defines Qt5::moc target which isn't available in that folder if it is included in 3rdparty directory - Set AUTOMOC and AUTOUIC properties for targets requiring them (rpcs3 and rpcs3_ui) instead of setting CMAKE_AUTOMOC and CMAKE_AUTOUIC so those properties are not defined for all targets under rpcs3 dir * CMake: Remove redundant code from rpcs3/CMakeLists.txt * CMake: Add BUILD_LLVM_SUBMODULE option instead of hardcoded check - Add BUILD_LLVM_SUBMODULE option (defaults to ON) to allow controlling usage of the LLVM submodule. - Move option definitions to root CMakeLists * CMake: Remove separate Emu subtargets - Based on discussion in pull request #5032, I decided to combine subtargets under Emu folder back to a single rpcs3_emu target * CMake: Remove utilities, loader and crypto targets: merge them to Emu - Removed separate targets and merged them into rpcs3_emu target as recommended in pull request (#5032) conversations. Separating targets probably later in a separate pull request * Fix relative includes in pad_thread.cpp * Fix Travis-CI cloning all submodules needlessly	2018-09-18 13:07:33 +03:00
scribam	343656f66d	cleanup: remove unnecessary return and namespace declaration	2018-09-06 13:15:59 +03:00
scribam	d7bb59cd99	c++17: use std::size	2018-09-06 13:15:59 +03:00
Nekotekina	ca5158a03e	Cleanup semaphore<> (sema.h) and mutex.h (shared_mutex) Remove semaphore_lock and writer_lock classes, replace with std::lock_guard Change semaphore<> interface to Lockable (+ exotic try_unlock method)	2018-09-03 23:00:36 +03:00
kd-11	85d38b5751	d3d12: restore working graphics	2018-09-03 18:24:20 +03:00
kd-11	6399833182	rsx: Fix endianness order when immediate mode register is updated, but used as register lookup - Simplify the code by unifying all the register-backed memory	2018-09-03 18:24:20 +03:00
Lassi Hämäläinen	79cf2832ae	Remove Utilities/variant.hpp and use C++17 variant - Remove also Utilities/variant_visitor.hpp - Fix variant and variant_visitor usages and #includes	2018-08-31 17:49:59 +04:00
Nekotekina	1c6c24f8ac	Update GSL and yaml-cpp submodules	2018-08-25 01:15:47 +03:00
eladash	f349695a75	Rsx: rewrite address translation	2018-08-13 16:16:34 +03:00
kd-11	e7f30640ef	rsx: Async shader compilation - Defer compilation process to worker threads - vulkan: Fixup for graphics_pipeline_state. Never use struct assignment operator on vk** structs due to padding after sType member (4 bytes)	2018-07-14 15:19:56 +03:00
kd-11	fa55a8072c	rsx: Improve vertex textures support - Adds proper support for vertex textures, including dimensions other than 2D textures - Minor analyser fixup, removes spurious 'analyser failed' errors - Minor optimizations for program state tracking	2018-07-12 18:02:28 +03:00
kd-11	d78957d1cf	rsx/vp: CodeGen improvements - Fix double destination writes on conditional write masking - Fix codegen to simplify simple scalar comparisons vs vector functions	2018-07-07 16:20:33 +03:00
kd-11	2c34195954	rsx/vp: Discard broken vertex programs with no writes to POS register	2018-07-07 16:20:33 +03:00
kd-11	2ca935a26b	vp: Improve vertex program analyser - Adds dead code elimination - Fix absolute branch target addresses to take base address into account - Patch branch targets relative to base address to improve hash matching - Bumps shader cache version - Enables shader logging option to write out vertex program binary, helpful when debugging problems.	2018-07-07 16:20:33 +03:00
kd-11	1730708f47	rsx: Rework memory protection management for framebuffer access - Avoid re-locking memory if there is no reason to do so (no draws issued) - Actively bound regions should always get written to the backing cache - Forcefully read memory during download if writes to the target have occured since last sync event	2018-06-26 20:07:20 +03:00
kd-11	3150619320	rsx: Preserve read AA state separate from write AA state - Some applications (e.g Backbreaker) use an evil hack to resolve MSAA. The application respecifies a formerly AA region as a region with no AA then performs a framebuffer feedback lookup. The old memory keeps AA during read, but writes back to itself with AA resolved. This is evil on several levels but it just happens to work on PS3	2018-06-08 22:17:50 +03:00
kd-11	6362942928	rsx: Avoid semaphore acquire deadlock	2018-05-30 13:30:23 +03:00
scribam	04ad49de4d	typos	2018-05-14 21:14:39 +04:00
kd-11	b7979d3f57	rsx/vk: Improvements and minor optimizations - Improve dirty state tracking affecting program state - vk: Refactor out transform constants upload into a separate channel to avoid if possible transform data uploads are quite expensive	2018-05-13 14:44:14 +03:00
Jake	6d6d6fa827	dx12/vk/gl: implement use of vertex_data_base_index when calculating index	2018-03-30 13:30:04 +03:00
kd-11	9fc1740608	rsx/fp: Fragment program overhaul - Separate TXB from TXL: They are completely different! - Properly perform TMU emulation in the fragment shader. Implemens SRGB conversion and alphakill at the moment - Properly perform ROP emulation in the fragment shader. Implements FRAMEBUFFER_SRGB. While support on the chip looks to be incomplete (and wierd), it does work - Document some more bits in SHADER_CONTROL register	2018-03-25 13:31:06 +03:00
pauls-gh	fd8d2ecbf4	Remove Volume Texture Compression (VTC) tiling for Vulkan, DX12 and ATI (OpenGL).	2018-03-23 12:01:30 +03:00
kd-11	315798b1f4	rsx: ZCULL rewrite and other improvements - ZCULL unit emulation rewritten - ZCULL reports are now deferred avoiding pipeline stalls - Minor optimizations; replaced std::mutex with shared_mutex where contention is rare - Silence unnecessary error message - Small improvement to out of memory handling for vulkan and slightly bump vertex buffer heap	2018-03-13 18:55:03 +03:00
kd-11	4804efc17d	rsx: Clear up confusion on depth writes. According to the NV_fragment_program spec, its not feasible to have 16-bit depth wries NOTE: NV_fragement_program precedes NV_fragment_program2 which is very close to what RSX consumes. It is hardware from that era afterall	2018-03-13 18:55:03 +03:00
kd-11	87741141f1	rsx/vulkan: Add post-compilation key validation and dynamically determine attachment write maks based on decompiled shader - A new step is added between decompilation and pipeline object creation allowing for properties to be updated based on shader contents - Allos masking off attachment writes that are unmodified in the shader	2018-03-13 18:55:03 +03:00
Nekotekina	cce0ad0c35	Clean vm::ps3 namespace use	2018-02-09 17:49:37 +03:00
kd-11	71f69d1d48	rsx/overlays: Introduce 'native' HUD UI and implement some common dialogs (#4011 )	2018-01-17 19:14:00 +03:00
Jake	7ca2c444cc	rsx: Fix depth clipping	2018-01-14 20:50:55 +03:00
Jake	c5074ba81f	d3d12: fix invalid framebuffer crash and shader compile	2018-01-14 20:50:55 +03:00
kd-11	f5145943b2	d3d12: Fix fragment shader compile	2017-12-04 18:22:18 +03:00
kd-11	cdd4fd9867	rsx/fp: Explicitly insert global functions. - Functions such as pack/unpack ops must exist before the shared gather functions are declared	2017-12-04 18:22:18 +03:00
kd-11	fe9090bd39	rsx/fp: Implement register gather (only for UP(X) instructions) - Workaround for temp register aliasing between H and R variants - TODO: Implement temp regs as 128 bit-blocks with r/w as pack/unpack	2017-12-01 21:00:50 +03:00
kd-11	a18ae0f6ac	rsx/fp: Reimplement PK(X) and UP(X) opcodes. The read back values are obviously in normalized range - Confirmed with a GOW shader which writes result of UP8 to BGRA8 output	2017-12-01 21:00:50 +03:00
kd-11	de5a4fe083	rsx: Reimplement depth <-> RGBA reinterpretation code - Implements proper channel order for fp24-ARGB8 conversion - Takes swizzle remap into account when reconstructing source bytes	2017-12-01 21:00:50 +03:00

1 2 3 4 5 ...

735 commits