rpcsx

mirror of https://github.com/RPCSX/rpcsx.git synced 2026-03-01 19:14:05 +01:00

Author	SHA1	Message	Date
kd-11	d9a9766e41	gl: Refactoring and fallback support for compute acceleration	2019-10-13 19:00:05 +03:00
kd-11	b39bfa02a6	gl: Windows bringup	2019-10-13 19:00:05 +03:00
kd-11	105d4b51e6	gl: Use compute shaders for typeless texture decode	2019-10-13 19:00:05 +03:00
kd-11	7a6e2e716f	gl: Add a framework for compute shaders	2019-10-13 19:00:05 +03:00
kd-11	4a19a2dd24	rsx: Explicity describe transfer regions for both source and destination blocks	2019-10-04 18:10:46 +03:00
kd-11	7aed9c3f13	gl: Add missing input declarations for 2-sided lighting	2019-09-30 21:52:43 +03:00
kd-11	88229f4716	gl: Remember to unbind attachments from active framebuffer after clear - If a stale reference is left lying around (e.g the texture bound to depth has been deleted and we attach a color image) no operations actually take place. glCheckFramebufferStatus also does not catch this problem.	2019-09-30 21:52:43 +03:00
kd-11	bcf8799079	rsx: Fix missing point size export - Sometimes program-point-size is enabled, but the vs does not actually write to the point size register. In this case, pass the incoming point size along instead of the default register init.	2019-09-30 01:40:04 +03:00
kd-11	2275259bf5	rsx: Properly scale overlay passes to match drawable area	2019-09-28 13:24:14 +03:00
kd-11	28534e8833	gl: Remove a debug print	2019-09-28 13:24:14 +03:00
Jan Beich	5ec35c7daa	rsx: unbreak build with Clang 9 ld: error: rpcs3/CMakeFiles/rpcs3.dir/main_application.cpp.o: unable to find library from dependent library specifier: opengl32.lib ld: error: rpcs3/Emu/librpcs3_emu.a(GLGSRender.cpp.o): unable to find library from dependent library specifier: opengl32.lib ld: error: rpcs3/Emu/librpcs3_emu.a(GLRenderTargets.cpp.o): unable to find library from dependent library specifier: opengl32.lib ld: error: rpcs3/Emu/librpcs3_emu.a(GLVertexBuffers.cpp.o): unable to find library from dependent library specifier: opengl32.lib	2019-09-24 01:00:45 +03:00
kd-11	e0005ec347	rsx: Refactoring and improvement - Separate displayed statistics from actual backend statistics. Allows asynchronous flipping to work correctly as it just uses display stats. The real stats are used by the frame scope marker to determine behavior like engaging the FIFO optimizer or skipping draw calls correctly.	2019-09-19 23:10:09 +03:00
kd-11	858014b718	rsx: Experiments with nul sink	2019-09-12 23:32:21 +03:00
kd-11	75fcfac00e	rsx: Modify find_cached_texture to respect gcm_format. Can pass 0 for "dont care"	2019-09-10 16:54:02 +03:00
kd-11	48a5cd545f	gl: Do not byteswap uint24_8 as it needs a custom 8_24 decoder	2019-09-08 13:56:41 +03:00
kd-11	a7b9ff33d8	gl: Warnings cleanup	2019-09-01 18:59:50 +03:00
kd-11	99fb6d6a5d	rsx: Allow GPU-accelerated stream manipulation when doing texture uploads	2019-08-30 21:46:19 +03:00
kd-11	e334a43169	rsx: Fix surface cache hit tests - Avoid silly broken tests due to queue_tag being called before pitch is initialized. - Return actual memory range covered and exclude trailing padding. - Coordinates in src are to be calculated with src_pitch, not required_pitch.	2019-08-28 14:54:51 +03:00
kd-11	2962e05f26	rsx: Implement per-RTT color masks - Also refactors and simplifies some common code in surface store and rsx core	2019-08-27 21:59:02 +03:00
kd-11	27aeaf66bc	gl: Restructure buffer objects to give more control over usage - This allows creating buffers with no MAP bits set which should ensure they are created for VRAM usage only - TODO: Implement compute kernels to avoid software fallback mode for pack/unpack operations	2019-08-27 21:59:02 +03:00
Nekotekina	d2eba2387b	Use g_fxo for display_manager	2019-08-27 03:50:15 +03:00
Nekotekina	928719b658	Use g_fxo for rsx::avconf	2019-08-27 03:50:15 +03:00
kd-11	3317e13b64	rsx: Hotfix for semaphore timeout bug - Add pending flip requests as a reason to invoke the RSX local task handler and release the vblank semaphore	2019-08-26 22:33:29 +03:00
kd-11	eed32cf3a4	rsx: Decompiler fixups and improvements - Fix 2D coordinate sampling of W coordinate. W is actually HPOS.w and not 1. Z is however always 0. - Optimize register usage a bit Disassembling compiled SPV shows that global declaration results in less ops than using inout modifiers. Modifiers generate extra mov instructions.	2019-08-26 20:03:31 +03:00
kd-11	3e28e4b1e0	rsx/decompiler: Restructure program register behavior - Fix reading of varying registers in FP Different registers have different behavior - Always write to varying registers. If a register is not written to, it is initialized to (0, 0, 0, 1) - Reimplements two-sided lighting correctly without hacks - Also bumps shader cache version	2019-08-26 20:03:31 +03:00
kd-11	f9aea076ae	rsx: Implement depth_buffer_float support. - Since this is transparent to the application at all time, it only becomes a problem when doing memory transfer or DEPTH->RGBA conversion in shaders.	2019-08-26 20:03:31 +03:00
kd-11	9d981de96d	rsx: Fix offloader deadlock - Do not allow offloader to handle its own faults. Serialize them on RSX instead. This approach introduces a GPU race condition that should be avoided with improved synchronization. - TODO: Use proper GPU-side synchronization to avoid this situation	2019-08-25 22:09:20 +03:00
kd-11	7c5bde4aeb	rsx: Update tag timestamp to match newest inherited data - Avoids memory appearing older when used for depth test without depth write The write_barrier before the call will inherit new data but the tag will not update as no new information is added.	2019-08-21 21:17:15 +03:00
kd-11	5d1b7eb945	rsx: Fix reference leaks in texture_cache<->surface_cache communication - Properly commit orphaned blocks not invalidating existing cache structures - Do not ignore overwritten objects when commiting as unprotected fbo. Avoids stale references to invalidated surface objects.	2019-08-21 21:17:15 +03:00
kd-11	ca8b0da141	gl: Invalidate range before reading to prevent deadlock	2019-08-21 21:17:15 +03:00
kd-11	141072023b	rsx: Fix handling of ARGB8 memory - Load into memory as straightforward BGRA - Fixes a bug in vulkan caused by byte shuffling in blit engine vs shader access - Removes the need for memory shuffling when transferring into a rendertarget	2019-08-21 21:17:15 +03:00
kd-11	35e61c77e0	gl: Fixup for D24S8 readback	2019-08-21 21:17:15 +03:00
kd-11	5e299111cc	rsx/vk: Restructure surface access barriers and implement RCB/RDB - Implements render target data load (aka Read Color Buffer/Read Depth Buffer) - Refactors vulkan surface barrier to be much cleaner. - Removes redundant surface barrier invocations after doing a merged load from surface cache. - Adds explicit access modes when gathering surfaces from cache.	2019-08-18 20:45:48 +03:00
kd-11	dfe709d464	rsx: Surface cache restructuring - Further improve aliased data preservation by unconditionally scanning. Its is possible for cache aliasing to occur when doing memory split. - Also sets up for RCB/RDB implementation	2019-08-18 20:45:48 +03:00
kd-11	a0f0c418d7	gl:Implement proper support for packed 16-bit rendertargets - Also some minor refactoring	2019-08-15 14:00:17 +03:00
kd-11	7f85b18b46	gl: Add support for 4444 typeless texture	2019-08-15 14:00:17 +03:00
RipleyTom	87bf0386c4	Screenshot function	2019-08-14 19:24:42 +02:00
Nekotekina	f63e89f9b4	Implement waitable atomics Moved Atomic.h to util/atomic.hpp List source files in CMakeLists.txt	2019-07-29 03:04:55 +03:00
kd-11	9a7c2784f0	rsx: Do not clip scissor to viewport when doing buffer clear	2019-07-20 16:39:32 +03:00
kd-11	e2574ff100	rsx: Support CSAA transparency without multiple rasterization samples enabled	2019-07-19 15:49:08 +03:00
kd-11	b5a2f0df68	rsx: Implement separate viewport raster clipping - Merge viewport raster window and scissor into one clipping region - Viewport raster clip is different from viewport geometry clipping in hardware as the latter is configurable separately	2019-07-19 14:21:19 +03:00
kd-11	ea2f4d57fa	rsx: Fixups	2019-07-17 13:29:42 +03:00
kd-11	998717659f	rsx: Fix reference leak when cloning surfaces	2019-07-17 13:29:42 +03:00
kd-11	009e01a347	rsx: Set up for multi-section inheritance	2019-07-17 13:29:42 +03:00
kd-11	956270d9be	gl: Add readback/writeback config for format GL_R16	2019-07-09 16:27:59 +03:00
kd-11	50736263d2	gl: Fix native pitch computation	2019-07-08 18:04:56 +03:00
Malcolm Jestadt	b5d5113803	gl: Workaround slow PBO usage with Mesa -Mesa is currently fastest with GL_STREAM_COPY -See `a338dc0186` -Also see https://bugs.freedesktop.org/show_bug.cgi?id=111043	2019-07-03 11:28:29 +03:00
Eladash	43f919c04b	Fixup after #6143 (#6146 ) vm::spu max address was overflowing resulting in issues, so cast to u64 where needed. Fixes #6145. Use vm::get_addr instead of manually substructing vm::base(0) from pointer in texture cache code. Prefer std::atomic_thread_fence over _mm_?fence(), adjust usage to be more correct. Used sequantially consistent ordering in semaphore_release for TSX path as well. Improved memory ordering for sys_rsx_context_iounmap/map. Fixed sync bugs in HLE gcm because of not using atomic instructions. Use release memory barrier in lwsync for PPU LLVM, according to this xbox360 programming guide lwsync is a hw release memory barrier. Also use release barrier where lwsync was originally used in liblv2 sys_lwmutex and cellSync. Use acquire barrier for isync instruction, see https://devblogs.microsoft.com/oldnewthing/20180814-00/?p=99485	2019-06-29 18:48:42 +03:00
JohnHolmesII	a124ec4a26	Remove braces around shader source strings (warnings)	2019-06-28 01:45:29 +03:00
JohnHolmesII	23094b48bb	Fix warnings related to -Wswitch Add default cases. Move default breaks to newline Add proper handling in some instances. Add missing enums to switches	2019-06-28 01:40:52 +03:00
kd-11	d26b25816d	rsx: Improve profiling setup - Avoid spamming QPC when not needed - Free performance when debug overlay is not enabled	2019-06-25 20:50:54 +03:00
kd-11	c32c1b0a62	gl: Minor API tweaks - Avoid spamming the driver with samplerParameter calls unless the parameters have actually changed	2019-06-25 20:50:54 +03:00
kd-11	59ee74a275	rsx: Disable vertex cache if multithreaded memory access is enabled - When multithreaded RSX is enabled, the vertex cache just lowers performance - The small cost of upload is paid by the asynchronous thread, allowing RSX to work optimally	2019-06-25 20:50:54 +03:00
kd-11	6be7c58fa4	glsl: Refactoring, cleanup and optimizations - Avoid generating unused code - Reduce GPR usage in emitted code	2019-06-25 20:50:54 +03:00
Lassi Hämäläinen	c963c51a60	Remove unnecessary header includes - Manually removed lot of unneeded #includes to clean code and reduce compilation time - Reordered some of the #includes to be in more logical order	2019-06-25 17:11:10 +03:00
Lassi Hämäläinen	a070a414a6	Move rsx::constants and rsx::limits to rsx_utils.h	2019-06-25 17:11:10 +03:00
Lassi Hämäläinen	e9e87b8bd9	Add missing #includes to header files - Multiple header files where missing #includes to other headers that where used in the header. Correct header was included in correct order in source files which caused everything to compile. - Added missing #includes so header files correctly include all their dependencies and fixes problems with IDEs being unable to parse headers correctly due to missing symbols	2019-06-25 17:11:10 +03:00
scribam	185fd3d257	rsx: Minor cleanup after #6055	2019-06-17 00:31:38 +03:00
kd-11	e4671c29a6	rsx: Fix typo - Arguments to the transform function are xxyy not xyxy	2019-06-14 16:19:52 +03:00
kd-11	8a1cf2c913	rsx: Attempt to reduce stencil load overhead for nvidia cards	2019-06-14 16:19:52 +03:00
kd-11	4a5bbba277	rsx: Enable MSAA - vk: Enable depth buffer resolve+unresolve - vk: Add AMD stenciling extension support - rsx: Temporarily disables MSAA-compatible hacks such as transparency AA - TODO: Add paths to optionally disable MSAA	2019-06-14 16:19:52 +03:00
kd-11	f6f3b40ecc	rsx: Fix AA coordinate transforms - Requires native_pitch value to take samples into account	2019-06-14 16:19:52 +03:00
kd-11	655eff29e8	rsx: Refactoring and cleanup after d3d12 separation - Remove deprecated functionality - Refactor to share code between common routines	2019-06-14 16:19:52 +03:00
kd-11	0d906d6974	rsx: Remove surface aa_mode hacks	2019-06-14 16:19:52 +03:00
scribam	13671d9684	rsx: Apply Clang-Tidy fix "modernize-loop-convert" + const when relevant	2019-06-12 15:11:52 +03:00
scribam	370dcd9d6e	rsx: Apply Clang-Tidy fix "readability-simplify-subscript-expr"	2019-06-12 15:11:52 +03:00
scribam	f1e939936a	rsx: Apply Clang-Tidy fix "modernize-use-override"	2019-06-12 15:11:52 +03:00
scribam	44265aa27d	rsx: Apply Clang-Tidy fix "modernize-use-equals-default"	2019-06-12 15:11:52 +03:00
scribam	635695ac78	rsx: Apply Clang-Tidy fix "modernize-use-emplace"	2019-06-12 15:11:52 +03:00
scribam	a02a8642b0	rsx: Apply Clang-Tidy fix "modernize-make-unique"	2019-06-12 15:11:52 +03:00
scribam	b91bcdbbca	rsx: Apply Clang-Tidy fix "modernize-use-bool-literals"	2019-06-12 15:11:52 +03:00
scribam	349e7c8708	rsx: Apply Clang-Tidy fix "readability-non-const-parameter"	2019-06-12 15:11:52 +03:00
scribam	ac7e89660f	rsx: Apply Clang-Tidy fix "readability-redundant-smartptr-get"	2019-06-12 15:11:52 +03:00
scribam	801fa0113f	rsx: Apply Clang-Tidy fix "readability-inconsistent-declaration-parameter-name"	2019-06-12 15:11:52 +03:00
scribam	8f2647555a	rsx: Apply Clang-Tidy fix "readability-redundant-string-init"	2019-06-12 15:11:52 +03:00
scribam	331fe01762	rsx: Apply Clang-Tidy fix "performance-for-range-copy"	2019-06-12 15:11:52 +03:00
scribam	db926ee671	rsx: Apply Clang-Tidy fix "performance-unnecessary-value-param"	2019-06-12 15:11:52 +03:00
scribam	c4667133c4	gl/vk: Add constexpr to varying_registers and sync functions between the two backends	2019-06-12 10:59:31 +01:00
kd-11	d361eedbec	rsx: Clean up window management code - Removes a lot of wm_event code that was used to perform window management and is no longer needed. - Significantly simplifies the vulkan code. - Implements resource management when vulkan window is minimized to allow resources to be freed.	2019-06-10 14:57:03 +03:00
Nekotekina	dfd50d0185	Implement std::bit_cast<> Partial implementation of std::bit_cast from C++20. Also fix most strict-aliasing rule break warnings (gcc).	2019-06-02 23:22:16 +03:00
scribam	09c9996f31	Use empty() instead of comparing size() with 0 Recommendation from Clang-Tidy: https://clang.llvm.org/extra/clang-tidy/checks/readability-container-size-empty.html	2019-06-01 22:59:23 +03:00
scribam	78c7ef3039	rsx: Use clear() instead of resize(0) The result is the same but clear [1] has slightly less code than resize [2] and signals better the intent IMHO. [1] `fb7fb646fa/libstdc%2B%2B-v3/include/bits/stl_vector.h (L1495)` [2] `fb7fb646fa/libstdc%2B%2B-v3/include/bits/stl_vector.h (L934)`	2019-06-01 22:59:23 +03:00
msuih	ef587d4cdc	Limit shaderlog writing behind log_programs setting	2019-05-31 19:49:32 +03:00
kd-11	c3b234f972	gl: Fix staging buffer size calculation	2019-05-22 01:18:46 +03:00
kd-11	05eb1e9193	rsx: Fix zombie image references from inside the texture cache - Do not add locked orphans to the flush_always cache! They will not remove their cache entries as they are not bound	2019-05-16 19:25:26 +03:00
kd-11	214bb3ec87	rsx: Always initialize memory unless it is guaranteed to be wiped	2019-05-16 19:25:26 +03:00
kd-11	88290d9fab	rsx: Hack around using data regions as transfer targets	2019-05-16 19:25:26 +03:00
kd-11	4182f9984d	rsx: Propagate split section information back to the texture cache	2019-05-16 19:25:26 +03:00
kd-11	4b443be881	rsx: Fix self-intersection with previous occupant of the address being replaced	2019-05-16 19:25:26 +03:00
kd-11	b840f6da28	[WIP] rsx: Use a sane reference counting model	2019-05-16 19:25:26 +03:00
kd-11	88c20afd3a	rsx: Implement unaligned surface inheritance with hierachial contribution - Allows render targets to behave like stacked 3D views same as shader inputs are resolved - Basically implements most of 'Read Color/Depth Buffers" option for 'free'. - Allows splitting RTV/DSV resources if they are superceded by a partial surface - Also allows intersecting new resources through the surface cache for proper inheritance from other scattered data - TODO: Refactor bind_surface_as_rtt and bind_surface_as_ds to reduce asinine code duplication	2019-05-16 19:25:26 +03:00
scribam	22f61caf9f	GLTexture: add missing #pragma once directive	2019-05-12 18:32:11 +03:00
scribam	6c5ea068c9	Remove redundant semicolons Fix "-Wextra-semi" warnings	2019-05-12 18:32:11 +03:00
scribam	3623f4343f	gl/vk: clear scissor_setup_invalid bit along with scissor_config_state_dirty bit	2019-05-11 13:13:49 +03:00
kd-11	9c346c92f3	gl: undo an accidental deletion	2019-05-05 13:37:55 +03:00
kd-11	1d5c52f476	rsx: Ignore stencil clear flag if the stencil write mask is disabled	2019-05-01 15:36:21 +03:00
kd-11	63f9b8e0c6	gl/vk: Minor cleanup	2019-05-01 15:36:21 +03:00
kd-11	4e3ec162e2	rsx: Fix broken texture cache search when flipping	2019-05-01 15:36:21 +03:00
kd-11	f56a6548b0	gl: Remove workaround for AMD driver bug fixed in driver 19.4.3	2019-05-01 15:36:21 +03:00
kd-11	60f3059d22	rsx: Compensate for nvidia's low precision attribute interpolation - The hw generates inaccurate values when doing perspective-correct interpolation of vertex output attributes and makes the comparison (a == b) fail even when they are a fixed constant value. - Increase equality tolerance when doing comparisons in fragment shaders for NV cards only to work around this issue. - Teepo fix	2019-04-25 16:23:05 +03:00
kd-11	463b1b220d	rsx: Improve accuracy of shadow compare Ops when non-integer depth formats are used - The fixed-point D24S8 format does special Z clamping during compare which matches PS3 behaviour - D32S8 is a floating point format and comparison with Dref > 1 always fails causing black edges/borders	2019-04-25 16:23:05 +03:00
kd-11	06a85f00d1	rsx: Shader decompiler cleanup and improvements - Improve support for float16_t by minimizing mixed inputs to functions (ambiguous overloads) - Minimize amount of downcasts in code by using opcode flags - Re-enable float16_t support for vulkan	2019-04-25 16:23:05 +03:00
kd-11	a668560c68	rsx: Use native half float types if available - Emulating f16 with f32 is not ideal and requires a lot of value clamping - Using native data type can significantly improve performance and accuracy - With openGL, check for the compatible extensions NV_gpu_shader5 and AMD_gpu_shader_half_float - With Vulkan, enable this functionality in the deviceFeatures if applicable. (VK_KHR_shader_float16_int8 extension) - Temporarily disable hw fp16 for vulkan	2019-04-25 16:23:05 +03:00
eladash	6f76e34104	rsx: Fix race on clearing native_ui vs emu_requested flag	2019-04-20 01:04:41 +03:00
kd-11	a5ed30a8c0	rsx: Fixups for data cast operations via typeless transfer	2019-04-09 13:40:54 +03:00
kd-11	f04a0a2bb6	rsx: Remove some old restrictions affecting memory persistence	2019-04-09 13:40:54 +03:00
kd-11	cc3809fbfe	gl: Register a few more missing formats for conversion	2019-04-09 13:40:54 +03:00
kd-11	e4e86455f2	rsx: Fix temporary subresource caching behaviour - Do not cache if a gathered subresource contains a bound RTT - Change op to dynamic copy if parent is still bound	2019-04-09 13:40:54 +03:00
kd-11	3249000511	rsx: Improvements to texture scanning - Removes CPU-only transforms that broke GPU-side code. -- Channels in GPU compute are laid out in cell-order, but CPU was uploading in favorable order and compensating with swizzles. -- This leads to 2 different layouts depending on the location of the data (CPU vs GPU) - Implement R8G8_R8B8 interleaved format decode - General improvements	2019-04-09 13:40:54 +03:00
kd-11	366e4c2422	rsx: Preliminary support for format conversions using typeless resolve	2019-04-09 13:40:54 +03:00
kd-11	dbc8e70ddd	rsx: Silence some compiler noise	2019-03-22 21:27:15 +03:00
kd-11	adc59f9810	rsx: Fix blit transfers when texel sizes mismatch - Also refactors some bpp handling code - Simplify texture intersection test to use a normalized/uniform coordinate space - Fix broken bounds checking as well	2019-03-22 21:27:15 +03:00
kd-11	b879b32271	rsx: Fix bpp calculation taking resolution scaling into account - Do not rely on image->width(), use surface_width() instead for unscaled values - Refactor/clean GL rendertarget class a bit	2019-03-20 10:05:54 +03:00
kd-11	bb65e45614	rsx: Implement GPU acceleration for rotated images	2019-03-17 21:50:11 +03:00
kd-11	5260f4b47d	rsx: Improvements to memory flush mechanism - Batch dma transfers whenever possible and do them in one go - vk: Always ensure that queued dma transfers are visible to the GPU before they are needed by the host Requires a little refactoring to allow proper communication of the commandbuffer state - vk: Code cleanup, the simplified mechanism makes it so that its not necessary to pass tons of args to methods - vk: Fixup - do not forcefully do dma transfers on sections in an invalidation zone! They may have been speculated correctly already	2019-03-17 21:50:11 +03:00
kd-11	385485204b	vk/gl: Omit unlocked data when grabbing flip sources from texture cache	2019-03-17 21:50:11 +03:00
kd-11	74eeacd091	vk/gl: Improve memory tag sync and test - Properly pass parameters such as rsx-pitch to the surface store - Do not crash if a surface fails verification in flip, use fall-back instead	2019-03-17 21:50:11 +03:00
kd-11	a49a0f2a86	vk/gl: Synchronization improvements - Properly wait for the buffer transfer operation to finish before map/readback! - Change vkFence to vkEvent which works more like a GL fence which is what is needed. - Implement supporting methods and functions - Do not destroy fence by immediately waiting after copying to dma buffer	2019-03-17 21:50:11 +03:00
kd-11	3a4083263e	rsx: Fix texture transfer when pitch does not match exactly	2019-03-17 21:50:11 +03:00
kd-11	1875dc3f18	gl: Fix buffer size calculations	2019-03-10 16:09:05 +03:00
kd-11	04dda44225	rsx: Properly generate render target data with all parameters provided - Build-up to variable-sized framebuffers and AA implementation - Also allows accurate range calculation for our hit testing	2019-03-10 16:09:05 +03:00
kd-11	9d4d3d9443	rsx: Reimplement render target intersection tests when using hw accelerated blit engine - Properly collapse memory tree when scanning in case of overlaps!	2019-03-10 16:09:05 +03:00
kd-11	7c379432dd	rsx: Implement proper pitch compatibility lookup - When a single row is required or is all that is available, pitch has no meaning as the coordinate space changed to 1D	2019-03-10 16:09:05 +03:00
kd-11	a80f1a6ed4	gl: Fix memory tag sampling - Also fixes a bad arg passed to glClearBuffer	2019-03-10 16:09:05 +03:00
kd-11	0395fb9955	rsx/tecture_cache: Addendum - fix data cast with scaling conversion (AA emulation) - Blit operations do format conversion automatically which is NOT what we want! - Scale onto temp buffer with similar format before performing data cast.	2019-03-10 16:09:05 +03:00
kd-11	10dc3dadee	rsx/texture_cache: Improve framebuffer memory locking when WCB/WDB is not enabled - Adds a new mode that removes non-framebuffer stuff inside framebuffer range	2019-03-10 16:09:05 +03:00
kd-11	563e205a72	rsx/texture_cache: Fix 'AA' scaling hack and restore collection template selection	2019-03-10 16:09:05 +03:00
kd-11	3a071a9c07	rsx: Texture search rewrite - Perform a full search across all resource types as needed without taking too many shortcuts/hacks	2019-03-10 16:09:05 +03:00
kd-11	ef071ebb6b	rsx: Synchronize surface cache and texture cache data - TODO: The whole upload_texture thing is a big hack, fix it properly	2019-03-10 16:09:05 +03:00
kd-11	38887bc03e	gl/vk: Improvements to overlay rendering - gl: Properly initialize and manage sampler states - gl/vk: Snap overlay elements to pixel grid by aligning to pixel centers - overlays: Disable grid snapping in stb since its now handled in the backend	2019-02-05 12:15:12 +03:00
kd-11	9e39e2d2c4	gl/vk: Fix clip region scaling for overlay elements	2019-02-02 11:54:01 +03:00
kd-11	9ed9d7e947	overlays/osk: Implement native osk interface	2019-02-02 11:54:01 +03:00
kd-11	660bfeabae	gl: Fixup - inline arrays	2019-01-25 14:34:22 +03:00
kd-11	fa9b448686	vk: Spec fixups - Disable DEPTH<->RGBA typeless transfers for now as they require a lot more work to work for all vendors - Do not allow switching layouts to UNDEFINED/PREINITIALIZED formats	2019-01-25 14:34:22 +03:00
kd-11	521969bcc3	gl: Remove GL_R 'format'. There is no GL_R format, it part of the S-T-Q-R enums for texture coordinate space	2019-01-25 14:34:22 +03:00
kd-11	5a4bea8c4f	gl: Blit fixup - Typo fix. I meant to disable scissor test, not stencil test - Also clean up and simplify/optimize the core logic	2019-01-25 14:34:22 +03:00
kd-11	fb778e4821	rsx: Reimplement attrib divisor	2019-01-25 14:34:22 +03:00
kd-11	6fdc0fd7f0	rsx: Reimplement MSAA transparency - Apply dither to edges that almost fail the straight-up alpha test - Significantly improves alpha tested geometry far from the camera - Also removes blend factor overrides/hacks as they give incorrect results due to background bleeding	2019-01-25 14:34:22 +03:00
kd-11	7eec702c6d	gl: Fix silly regression with blit dst resource readback	2019-01-25 14:34:22 +03:00
kd-11	8093c9b573	rsx: Disable rtt side-effects when async compilation is ongoing. Only real renders should promote buffer state from underined to drawn, otherwise keep previous contents intact.	2019-01-25 14:34:22 +03:00
kd-11	417a2e6731	rsx: Refactor index buffers - Index offset is ignored anyway and only used to calculate vertex attribute divisor index - Specialized optimization for untouched xfer without primitive restart	2019-01-25 14:34:22 +03:00
Nekotekina	bd9131ae1c	Implement fs::get_cache_dir Win32: equal to config dir for now Linux: respect XDG_CACHE_HOME if specified OSX: possibly incomplete	2019-01-13 14:45:36 +03:00
kd-11	52ac0a901a	rsx: improve memory coherency - Avoid tagging and rely on read/write barriers and the dirty flag mechanism. Testing is done with a weak 8-byte memory test - Introducing new data when tagging breaks applications with race conditions where tags can overwrite flushed data	2019-01-06 10:44:40 +03:00
kd-11	95245bdd83	rsx: Improve ARGB8->D24S8 casting - Set up partial transfers - Force clear of target before starting the transfer	2019-01-06 10:44:40 +03:00
kd-11	475cc99117	rsx: Fix dirty flag reset after a partial attachment initialization - D24S8 targets have 2 aspects that are dealt with separately; Forcefully initialize the remaining data if a partial init is done. Its 'free' anyway - It seems that the stencil mask matters when clearing unlike the depth mask and color mask	2019-01-06 10:44:40 +03:00
kd-11	c80c7f06bb	rsx: Typo fix - This silly typo broke the flip improvements in the GT fixes PR	2019-01-06 10:44:40 +03:00
kd-11	2a62fa892b	rsx: Texture cache refactor - gl: Include an execution state wrapper to ensure state changes are consistent. Also removes a lot of required 'cleanup' for helper methods - texture_cache: Make execition context a mandatory field as it is required for all operations. Also removes a lot of situations where duplicate argument is added in for both fixed and vararg fields - Explicit read/write barrier for framebuffer resources depending on usage. Allows for operations like optional memory initialization before reading	2019-01-06 10:44:40 +03:00
kd-11	1ffadbe086	rsx: Reorganize write barrier implementation (either clear or memory barrier)	2019-01-06 10:44:40 +03:00
kd-11	a95a44cf66	rsx: Strictness cleanups - Also account for variable pitch textures (swizzled scan)	2019-01-06 10:44:40 +03:00
kd-11	474d0f61a2	minor typo fix	2019-01-06 10:44:40 +03:00
kd-11	362eea09a1	whitespace fix only	2019-01-06 10:44:40 +03:00
kd-11	15d5507154	rsx: Rewrite memory inheritance transfers - Implicitly invoke a memory barrier if actively reading from an unsynchronized texture - Simplify memory transfer operations - Should allow more games to work without strict mode	2019-01-06 10:44:40 +03:00
kd-11	97704d1396	rsx: Fix texture size calculations	2019-01-06 10:44:40 +03:00
kd-11	15488eb247	rsx: Avoid unnecessarily touching framebuffer memory - Do not bind companion framebuffer when clearing single aspect; let the contest mechanism sort it out instead - Do not prematurely tag framebuffers, instead only do so at write-confirmation time. Should avoid false tagging if setup does not allow a render to occur.	2019-01-06 10:44:40 +03:00
kd-11	9c46386dd4	rsx: Check av configuration when selecting display buffers! - Some applications have mismatch between video output configuration and display buffer sizes	2018-12-24 09:05:19 +03:00
kd-11	4b79ef1ad9	rsx: Implement stencil mirror views - Implements a mirror view of D24S8 data that accesses the stencil components. Finishes the implementation of TEX2D_DEPTH_RGBA as the stencil component was previously missing from the reconstructed data - Add a few missing destructors Image classes are inherited a lot and I forgot to make the dtors virtual	2018-12-24 09:05:19 +03:00
kd-11	c75749f8ce	rsx: fix flip logic when grabbing output from the surface cache	2018-12-24 09:05:19 +03:00
Rui Pinheiro	bcdf91edbb	Misc. Texture Cache fixes	2018-12-11 22:37:10 +03:00
Rui Pinheiro	9d1cdccb1a	Implement dedicated texture cache predictor	2018-12-11 22:37:10 +03:00
Rui Pinheiro	af360b78f2	Texture cache section management fixups Fixes VRAM leaks and incorrect destruction of resources, which could lead to drivers crashes. Additionally, lock_memory_region is now able to flush superseded sections. However, due to the potential performance impact of this for little gain, a new debug setting ("Strict Flushing") has been added to config.yaml	2018-12-11 22:37:10 +03:00
kd-11	504ab5a6d4	rsx: Minor cleanup to silence stupid compiler warnings	2018-12-03 20:01:23 +03:00
kd-11	2168159d03	gl: Fix flip regression - Restore graphics state after flip (including active fbo) because flip can be made through a syscall	2018-11-30 23:51:25 +03:00
kd-11	b96ed5cd4e	gl: Do not rely on driver statistics for s3TC textures; they are inconsistent.	2018-11-30 23:51:25 +03:00
kd-11	5b6e1420f3	rsx: Pipeline barriers fixed up - Ensure barriers are invoked even if no draw occurs! -- Ensures that deferred commands are executed eventually	2018-11-30 23:51:25 +03:00
kd-11	8a186bb97e	rsx: Fix insertion of execution barriers - Ignore barriers inserted after BEGIN but before any draw commands are emitted - Properly process tail barriers inserted before END but after draw commands are submitted - Ignore execution barriers with no effect (same register value written)	2018-11-30 23:51:25 +03:00
kd-11	5193c99973	rsx: Enable dynamic FIFO preprocessing - Tries to detect when FIFO preprocessing is beneficial and only enables optimizations if the benefit outweighs the cost - Current threshold is at least 500 draw calls saved at over 2000 draw calls to justify the overhead - TODO: More tuning for other CPUs	2018-11-30 23:51:25 +03:00
kd-11	7b065d7781	rsx: Fixup; input attributes blob decoding - Use an unstructured blob and index into the vec4 structures to extract the real data	2018-11-30 23:51:25 +03:00
kd-11	846daadd5d	rsx: Fixups - Improve vertex attribute layout format. Allows for full 16-bit attribute divisor - Use actual pitch when declaring framebuffer rsx pitch instead of register value in case of swizzle? rendering	2018-11-30 23:51:25 +03:00
kd-11	d6b4440ef9	gl: Separate vertex env from program env	2018-11-30 23:51:25 +03:00
kd-11	54ec363e88	rsx: Critical pipeline fixes - Fix scissor and viewport binding behavior - Fixes recovery if empty scissor is specified and then 'fixed' later - Optimizes state binding a bit	2018-11-30 23:51:25 +03:00
kd-11	1ad76ad331	rsx: Restructure programs - Also re-enable pipeline optimizations	2018-11-30 23:51:25 +03:00
kd-11	b0a6b72ce8	rsx: Optimizations - Replace a few more vectors with simple_array<T> - Avoid unnecessary string comparisons in backends. We already know referenced textures from the program analysers!	2018-11-30 23:51:25 +03:00
kd-11	677b16f5c6	rsx: Fixups - Also fix visual corruption when using disjoint indexed draws - Refactor draw call emit again (vk) - Improve execution barrier resolve - Allow vertex/index rebase inside begin/end pair - Add ALPHA_TEST to list of excluded methods [TODO: defer raster state] - gl bringup - Simplify - using the simple_array gets back a few more fps :)	2018-11-30 23:51:25 +03:00
kd-11	e01d2f08c9	rsx: Refactor FIFO - Removes fifo structures from common RSXThread - Sets up a dedicated FIFO controller - Allows for configurable queue optimizations	2018-11-30 23:51:25 +03:00
Nekotekina	1b37e775be	Migration to named_thread<> Add atomic_t<>::try_dec instead of fetch_dec_sat Add atomic_t<>::try_inc GDBDebugServer is broken (needs rewrite) Removed old_thread class (former named_thread) Removed storing/rethrowing exceptions from thread Emu.Stop doesn't inject an exception anymore task_stack helper class removed thread_base simplified (no shared_from_this) thread_ctrl::spawn simplified (creates detached thread) Implemented overrideable thread detaching logic Disabled cellAdec, cellDmux, cellFsAio SPUThread renamed to spu_thread RawSPUThread removed, spu_thread used instead Disabled deriving from ppu_thread Partial support for thread renaming lv2_timer... simplified, screw it idm/fxm: butchered support for on_stop/on_init vm: improved allocation structure (added size)	2018-10-19 22:22:35 +03:00
eladash	83b6c98563	rsx: Fix u16 index arrays overflow Force u32 index array destinations to avoid overflows when adding vertex base index.	2018-10-08 16:39:47 +03:00
eladash	fa723f6dc4	rsx: Fix texture depth read	2018-10-03 20:57:46 +03:00
eladash	a92ae827c1	rsx: Remove texture mipmap hack	2018-10-03 20:57:46 +03:00
Nekotekina	da6ce80f4f	Make vm::get_super_ptr return contiguous memory Cleanup RSX code complexity	2018-09-27 23:37:13 +03:00
kd-11	6a9f234dc7	rsx: Fixup flip behaviour - handle_emu_flip is very heavy, only fire	2018-09-26 19:41:50 +03:00
kd-11	f72157bcec	rsx: Fix vertex attrib parsing	2018-09-25 22:03:35 +03:00
kd-11	a3d44b5e1f	rsx: Cleanup changes for the flip patch	2018-09-24 16:44:02 +03:00
Jake	699eadc84f	rsx: Move render flip from rsx queue command to flip command	2018-09-24 16:44:02 +03:00
Rui Pinheiro	35139ebf5d	Texture cache cleanup, refactoring and fixes	2018-09-24 15:26:40 +03:00
eladash	06572c6011	rsx: Fix vertex count if all the streams are disabled	2018-09-24 13:25:05 +03:00
kd-11	dafc914bcc	rsx: temporary hack - Removes all use of valid_count as a metric until the new refactor is merged	2018-09-21 16:32:23 +03:00
kd-11	2b6e6a9ae9	gl: Fix problems with framebuffer reuse - Matching attachments with resource id fails because drivers are reusing handles! - Properly sets up stale fbo ref counting and removal - Properly sets up resource reference test with subsequent removal to avoid using a broken fbo entry	2018-09-21 16:32:23 +03:00
kd-11	fc486a1bac	rsx: Preserve memory order when doing flush - Orders flushing to preserve memory at all cost - Avoids false positive where flushing overlapping sections can falsely invalidate another with head/tail test	2018-09-21 16:32:23 +03:00
kd-11	23dc9d54e3	rsx: Fix flip source selector	2018-09-21 16:32:23 +03:00
kd-11	a21bdb9f45	rsx; blit engine fixes - Forcefully downloads and reuploads data from the CPU in case of unexpected overlaps - Properly detect correct size of newly created blit targets - Remember to clear any existing views when changing the default component map!	2018-09-21 16:32:23 +03:00
kd-11	d6dc1493cb	rsx/overlays: Implement blur, darkening and ability to disable custom background	2018-09-18 16:24:13 +03:00
kd-11	9f61fb5a78	overlays: Allow custom background for message dialog	2018-09-18 16:24:13 +03:00
Lassi Hämäläinen	7aef811ff7	CMake: Refactor CMake build (#5032 ) * CMake: Refactor build to multiple libraries - Refactor CMake build system by creating separate libraries for different components - Create interface libraries for most dependencies and add 3rdparty::* ALIAS targets for ease of use and use them to try specifying correct dependencies for each target - Prefer 3rdparty:: ALIAS when linking dependencies - Exclude xxHash subdirectory from ALL build target - Add USE_SYSTEM_ZLIB option to select between using included ZLib and the ZLib in CMake search path * Add cstring include to Log.cpp * CMake: Add 3rdparty::glew interface target * Add Visual Studio CMakeSettings.json to gitignore * CMake: Move building and finding LLVM to 3rdparty/llvm.cmake script - LLVM is now built under 3rdparty/ directory in the binary directory * CMake: Move finding Qt5 to 3rdparty/qt5.cmake script - Script has to be included in rpcs3/CMakeLists.txt because it defines Qt5::moc target which isn't available in that folder if it is included in 3rdparty directory - Set AUTOMOC and AUTOUIC properties for targets requiring them (rpcs3 and rpcs3_ui) instead of setting CMAKE_AUTOMOC and CMAKE_AUTOUIC so those properties are not defined for all targets under rpcs3 dir * CMake: Remove redundant code from rpcs3/CMakeLists.txt * CMake: Add BUILD_LLVM_SUBMODULE option instead of hardcoded check - Add BUILD_LLVM_SUBMODULE option (defaults to ON) to allow controlling usage of the LLVM submodule. - Move option definitions to root CMakeLists * CMake: Remove separate Emu subtargets - Based on discussion in pull request #5032, I decided to combine subtargets under Emu folder back to a single rpcs3_emu target * CMake: Remove utilities, loader and crypto targets: merge them to Emu - Removed separate targets and merged them into rpcs3_emu target as recommended in pull request (#5032) conversations. Separating targets probably later in a separate pull request * Fix relative includes in pad_thread.cpp * Fix Travis-CI cloning all submodules needlessly	2018-09-18 13:07:33 +03:00
scribam	4cb98014a2	rsx: tiny zcull optimizations	2018-09-13 12:43:40 +03:00
eladash	efbd77deb4	rsx: dont silently ignore null shader address	2018-09-12 00:40:20 +03:00
kd-11	f413996362	rsx: Minor texture cache fixes - Retag resources reprotected under flush_always rules - Properly check for blit resource fitting taking into account format mismatch, pitch mismatch and typeless transfers	2018-09-10 15:43:28 +03:00
scribam	343656f66d	cleanup: remove unnecessary return and namespace declaration	2018-09-06 13:15:59 +03:00
scribam	2834c88de7	cleanup: remove intermediate const char* variables	2018-09-06 13:15:59 +03:00
scribam	f83d381e1e	clang-tidy: use nullptr	2018-09-06 13:15:59 +03:00
scribam	c4cff9b543	clang-tidy: remove redundant "apply_swizzle_remap" declaration	2018-09-06 13:15:59 +03:00
scribam	d7bb59cd99	c++17: use std::size	2018-09-06 13:15:59 +03:00
Nekotekina	ca5158a03e	Cleanup semaphore<> (sema.h) and mutex.h (shared_mutex) Remove semaphore_lock and writer_lock classes, replace with std::lock_guard Change semaphore<> interface to Lockable (+ exotic try_unlock method)	2018-09-03 23:00:36 +03:00
kd-11	5a08b690d5	gl: always clean up the heap when using legacy buffers	2018-09-03 18:24:20 +03:00
scribam	bf89b709cb	Remove useless #include	2018-08-31 20:13:54 +04:00
scribam	7d0e94ab0a	Compilation fixes for optional on osx	2018-08-31 20:13:54 +04:00
Lassi Hämäläinen	79cf2832ae	Remove Utilities/variant.hpp and use C++17 variant - Remove also Utilities/variant_visitor.hpp - Fix variant and variant_visitor usages and #includes	2018-08-31 17:49:59 +04:00
kd-11	c6e35706a3	vk: Support sw component swizzle decode because metal sucks	2018-08-23 22:54:56 +03:00
kd-11	ec31157bc7	gl: Avoid unnecessary scissor state change every draw call.	2018-08-18 16:14:30 +03:00
kd-11	8c93db342f	gl: Reuse framebuffer resources - WIP optimizations for GL backend	2018-08-18 16:14:30 +03:00
kd-11	f8a9b1fa30	[WIP] rsx: Improve memory inheritance hierachy - Cascade memory writes by invalidating 'downstream' subsurfaces - Fixup; always resolve for overlapping surfaces before sampling (force atlas gather test)	2018-08-18 16:14:30 +03:00
kd-11	ba5b59dc59	gl: Do not create secondary context if async is disabled - Some third party programs fall apart when multiple contexts are created	2018-08-18 16:14:30 +03:00
kd-11	741ee9ac41	rsx: Allow linear filtering when reading back GPU-resident memory	2018-08-18 16:14:30 +03:00
kd-11	d0165290b6	rsx: Refactor and fix framebuffer layout checks - Refactors shared code back into rsx core - Adds extra check to avoid contest confusion	2018-08-18 16:14:30 +03:00
kd-11	0267221586	Minor optimizations and fixes - FIFO: avoid multiline spam - VK: Fix program setup counter - FS: Precalculate fragment constants buffer size during analysis step	2018-08-18 16:14:30 +03:00
kd-11	3b47e43380	rsx: Synchronization rewritten - Do not do a full sync on a texture read barrier - Avoid calling zcull sync in FIFO spin wait - Do not flush memory to cache from the renderer side; this method is now obsolete	2018-08-18 16:14:30 +03:00
eladash	f349695a75	Rsx: rewrite address translation	2018-08-13 16:16:34 +03:00
kd-11	19d808d378	rsx/gl: Minor cleanup and optimization - Track register change status - Remove unused gl classes	2018-07-22 17:19:59 +03:00
kd-11	8695f95267	rsx: Reimplement cached textures and their views	2018-07-22 17:19:59 +03:00
kd-11	e7f30640ef	rsx: Async shader compilation - Defer compilation process to worker threads - vulkan: Fixup for graphics_pipeline_state. Never use struct assignment operator on vk** structs due to padding after sType member (4 bytes)	2018-07-14 15:19:56 +03:00
kd-11	fa55a8072c	rsx: Improve vertex textures support - Adds proper support for vertex textures, including dimensions other than 2D textures - Minor analyser fixup, removes spurious 'analyser failed' errors - Minor optimizations for program state tracking	2018-07-12 18:02:28 +03:00
kd-11	1ddcad4fa4	facepalm - Fix openGL regression	2018-07-09 13:06:00 +03:00
kd-11	d78957d1cf	rsx/vp: CodeGen improvements - Fix double destination writes on conditional write masking - Fix codegen to simplify simple scalar comparisons vs vector functions	2018-07-07 16:20:33 +03:00
kd-11	2c34195954	rsx/vp: Discard broken vertex programs with no writes to POS register	2018-07-07 16:20:33 +03:00
kd-11	2ca935a26b	vp: Improve vertex program analyser - Adds dead code elimination - Fix absolute branch target addresses to take base address into account - Patch branch targets relative to base address to improve hash matching - Bumps shader cache version - Enables shader logging option to write out vertex program binary, helpful when debugging problems.	2018-07-07 16:20:33 +03:00
kd-11	24f4c92759	rsx: Improve texture cache read speculation	2018-06-26 20:07:20 +03:00
kd-11	1e375e5210	gl: Fixup	2018-06-26 20:07:20 +03:00
kd-11	1730708f47	rsx: Rework memory protection management for framebuffer access - Avoid re-locking memory if there is no reason to do so (no draws issued) - Actively bound regions should always get written to the backing cache - Forcefully read memory during download if writes to the target have occured since last sync event	2018-06-26 20:07:20 +03:00
kd-11	f45dcfe18a	rsx: Fix texture readback - gl: Fix up the calculation for internal image pitch - vk: Implement GPU-side resizing for read back textures (fixes WCB zoom)	2018-06-26 20:07:20 +03:00
eladash	3e433ef05c	create the shaderlog dir in Emu.Init()	2018-06-21 22:54:08 +04:00
kd-11	8f1c36d79f	rsx: Fix region pitch inaccuracy - Region pitch of 64 (disabled) can be used to indicate packed contents - do not assume it is the actual pitch! - Also fixes interaction of AA factors with lockable_region size	2018-06-21 13:08:50 +03:00
VelocityRa	44449dd9e9	overlays: Refactoring - Use names for overlay command config and vertex data instead of std::pair. - Make a couple of compiled_resource constructors explicitly named functions.	2018-06-18 22:34:26 +03:00
kd-11	d77e62c94e	rsx: Improve GPU resource read prediction	2018-06-18 17:32:22 +03:00
Megamouse	a8f19fbfae	RSX: fix shader cache progress bar exit state shenanigans	2018-06-11 22:41:38 +03:00
Megamouse	4003aacc6a	RSX: add taskbar progress to native ui progress dialogs	2018-06-08 23:41:56 +03:00
kd-11	c9e367befd	rsx/debug: Fix rendering when FIFO reordering is disabled	2018-06-08 22:17:50 +03:00
kd-11	1b9c9267f0	rsx: Update memory flags after memory transfer	2018-06-08 22:17:50 +03:00
kd-11	fc18e17ba6	vk: Implement depth scaling using hardware blit/copy engines - Removes the old depth scaling using an overlay. It was never going to work properly due to per-pixel stencil writes being unavailable - TODO: Preserve stencil buffer during ARGB8->D32S8 shader conversion pass	2018-06-08 22:17:50 +03:00
kd-11	3150619320	rsx: Preserve read AA state separate from write AA state - Some applications (e.g Backbreaker) use an evil hack to resolve MSAA. The application respecifies a formerly AA region as a region with no AA then performs a framebuffer feedback lookup. The old memory keeps AA during read, but writes back to itself with AA resolved. This is evil on several levels but it just happens to work on PS3	2018-06-08 22:17:50 +03:00
kd-11	0f24379c0e	rsx: Obey MSAA resolve during memory persistence transfer - Ugh. This is a bandaid on a festering wound, AA badly needs a rewrite Also silence some warnings	2018-06-08 22:17:50 +03:00
Dravonic	400079a006	Parallel shader cache loading (#4677 ) * Parallel shader cache loading	2018-06-01 19:49:29 +03:00
kd-11	f543fb0243	vk/gl: Fix flush synchronization to be kinder to weaker CPUs but not harm higher end CPUs	2018-05-30 13:30:23 +03:00
kd-11	6362942928	rsx: Avoid semaphore acquire deadlock	2018-05-30 13:30:23 +03:00
VelocityRa	33b01d9306	overlays: Allow for non-interactable UI components * Also fix a few warnings in overlay_controls	2018-05-30 12:35:41 +03:00
kd-11	83f9be2524	rsx: Promote FIFO optimizations outside of strict mode - The benefits of FIFO optimizations are huge in some cases. The optimizations also do not break any tested applications so no need to disable with strict mode - A debug option is provided to disable this behaviour for testing	2018-05-29 13:54:30 +03:00
kd-11	2adb2ebb00	overlays: Avoid race condition on remove-on-update views - Improves cleanup code to consist of 2 parts, remove then dispose. Remove does not deallocate the item until dispose is called on it, allowing the backends to first deallocate external references. - Caller is responsible for managing list locking and tracking disposable list of items when external references have been cleaned up before using dispose method.	2018-05-29 13:54:30 +03:00
kd-11	0fc67aa2f6	gl: fix wcb regression - Partial framebuffers and blit targets are possible!	2018-05-24 10:36:04 +03:00
kd-11	b957eac6e8	rsx: Avoid calling any blocking callbacks from threads that are not rsx::thread - Defers on_notity_memory_unmapped to only run from within rsx context - Avoids passive_lock + writer_lock deadlock	2018-05-23 19:07:08 +03:00
kd-11	d2bf04796f	Optimized cached write-through - Allows grabbing an unsynchronized memory block if overwriting contents anyway - Allows flushing only specified range of memory	2018-05-23 19:07:08 +03:00
kd-11	fbf6581249	rsx: Fix segmented memory access for rsx::super_ptr	2018-05-23 19:07:08 +03:00
kd-11	f2a3167193	rsx: Lower format compatibility severity since it confuses some people	2018-05-23 19:07:08 +03:00
kd-11	8fcd5c1e5a	rsx: Texture cache fixes 1. rsx: Rework section synchronization using the new memory mirrors 2. rsx: Tweaks - Simplify peeking into the current rsx::thread instance. Use a simple rsx::get_current_renderer instead of asking fxm for the same - Fix global rsx super memory shm block management 3. rsx: Improve memory validation. test_framebuffer() and tag_framebuffer() are simplified due to mirror support 4. rsx: Only write back confirmed memory range to avoid overapproximation errors in blit engine 5. rsx: Explicitly mark clobbered flushable sections as dirty to have them removed 6. rsx: Cumulative fixes - Reimplement rsx::buffered_section management routines - blit engine subsections are not hit-tested against confirmed/committed memory range Not all applications are 'honest' about region bounds, making the real cpu range useless for blit ops	2018-05-23 19:07:08 +03:00
kd-11	c9669818eb	Facepalm - overlays: Do not free self handle!!!!	2018-05-21 15:55:25 +03:00
kd-11	f6f45b8699	Native UI refactored (#4623 ) Refactor and improve native overlays	2018-05-20 23:05:00 +03:00
scribam	04ad49de4d	typos	2018-05-14 21:14:39 +04:00
kd-11	1aa44ede31	gl: Improve AMD multidraw workaround - Reimplements the AMD workaround using an identity buffer to avoid the performance hit of doing multiple glDrawArrays for every single compiled set - Reimplements first/count allocation using a scratch buffer to reduce allocation overhead when large number of draw calls is used	2018-05-13 14:44:14 +03:00
kd-11	b7979d3f57	rsx/vk: Improvements and minor optimizations - Improve dirty state tracking affecting program state - vk: Refactor out transform constants upload into a separate channel to avoid if possible transform data uploads are quite expensive	2018-05-13 14:44:14 +03:00
kd-11	440a31ef18	rsx: Optimizations for program management	2018-05-13 14:44:14 +03:00
kd-11	a52ea7f870	rsx: Improve fragment and vertex program usage - Introduces a gpu program analyser step to examine shader contents before attempting compilation or cache search - Avoids detecting shader as being different because of unused textures having state changes - Adds better program size detection for vertex programs - Improved vertex program decompiler - Properly support CAL type instructions - Support jumping over instructions marked with a termination marker with BRA/CAL class opcodes - Fix SRC checks and abort - Fix CC register initialization - NOTE: Even unused SRC registers have to be valid (usually referencing in.POS)	2018-05-13 14:44:14 +03:00
kd-11	98b715d8c8	gl: Workaround for AMD driver bug	2018-04-25 19:14:36 +03:00
kd-11	ffa62918aa	gl: Improve pixel transfer code and notify on AMD driver bug - Readback does not work at all with float textures on AMD openGL Driver throws a bogus OUT_OF_MEMORY error regardless of amount of VRAM and system RAM available	2018-04-25 19:14:36 +03:00
kd-11	58035697d5	rsx: Restore component mapping override for depth textures	2018-04-25 19:14:36 +03:00
kd-11	91a6091d26	rsx: Minor fixes - vk: Clear dirty textures before copying 'old contents' in case the old data does not fill the new region - rsx: Properly decode border color - seems to be in BGRA format - vk: better approximation of border color to better choose between the presets - vk: Individually clear color images outside render pass and without scissor - vk: Fix renderpass selection for clear overlay pass - vk: Include scissor region when emulating clear mask NOTES: - vk: Completely avoid using vkClearXXXXimage - its 'broken' on nvidia drivers Spec is vague about the function so its not an actual bug ClearAttachment is clearly defined as bypassing bound state which works correctly - TODO: Implement memory sampling to simulate loading precleared memory if cell used memset to preinitialize the framebuffer Autoclear depth to 1\|255 and color to 0 is hacky!	2018-04-25 19:14:36 +03:00
kd-11	a42b00488d	rsx: Texture fixes - gl/vk: Fix subresource copy/blit - gl/vk: Fix default_component_map reading - vk: Reimplement cell readback path and improve software channel decoder - Properly name the subresource layout field - its in blocks not bytes! - Implement d24s8 upload from memory correctly - Do not ignore DEPTH_FLOAT textures - they are depth textures and abide by the depth compare rules - NOTE: Redirection of 16-bit textures is not implemented yet	2018-04-25 19:14:36 +03:00
kd-11	63d9cb37ec	rsx: Framebuffer fixes Primary: - Fix SET_SURFACE_CLEAR channel mask - it has been wrong for all these years! Layout is RGBA not ARGB/BGRA like other registers Other Fixes: - vk: Implement subchannel clears using overla pass - vk: Simplify and clean up state management - gl: Fix nullptr deref in case of failed subresource copy - vk/gl: Ignore float buffer clears as hardware seems to do	2018-04-25 19:14:36 +03:00
kd-11	9abbbb79ae	rsx: Blit engine fixes - Ignore unlocked blit sections [TODO] - Do not attempt blit on hw if bytesize is unsupported - gl: Implement typeless memory transfers Uses pbo to handle type-agnostic memory transfer	2018-04-25 19:14:36 +03:00
kd-11	bb5622401c	overlays/gl: minor fixes - fix ogl color map for overlay resources - fix label background for save dialog	2018-04-25 19:14:36 +03:00
kd-11	6d46ac1ad6	gl: Reimplement textures - Separate texture data from texture views	2018-04-25 19:14:36 +03:00
kd-11	c5cd758700	rsx: Workaround for G8B8 render targets - Mainly affected are colormasks and read swizzles NOTES: - Writes to G write to the second and fourth component (YW) - Writes to B write to first and third component (XZ) - This means the actual format layout is BGBG (RGBA) making RG mapping actually GR - Clear does not seem to have any intended effect on this format (TLOU)	2018-04-25 19:14:36 +03:00
Talkashie	64992f758d	Fix typos (#4410 ) * MASSIVE TYPO FIX part 1 * ANOTHER HUUUUGE TYPO FIX part 2 * thank you :hcorion: for all of your help. I could not have done this without you	2018-04-08 01:01:39 +01:00
kd-11	e291494282	rsx: Texture cache updates - Properly implement section gather for 3d and cubemaps Implements render-to-3d and fixes some corner cases for render-to-cubemap	2018-04-05 01:06:50 +03:00
Jake	6d6d6fa827	dx12/vk/gl: implement use of vertex_data_base_index when calculating index	2018-03-30 13:30:04 +03:00
pauls-gh	a17025c465	Strict Rendering Mode (SRM) fix. Move old surface copy before texture upload. Fixes the following issues on Tales of Vesperia which requires SRM. - Blacked out scene after the sleeping dog now renders correctly - Ghosting effect. The ghosting was most noticeable as a delay between the character rendering and the cell shading around the character. This appears to be gone with this change.	2018-03-29 11:01:58 +03:00
kd-11	887ea43e39	rsx: Fix some texture cache problems - gl/vk: Properly handle remapping temporary resources	2018-03-25 13:31:06 +03:00
kd-11	9fce5b0f7a	gl: Fix leaking occlusion queries - GL queries share the target binding (not asynchronous!) - Discard active queries by closing them, leave closed queries alone (nothing to be done for discard op)	2018-03-25 13:31:06 +03:00
kd-11	22af70d0d0	gl: Always use indexed blend caps to avoid conflict with the state cache. - glEnable/glDisable should not be used with GL_BLEND as the main renderer uses the indexed variant	2018-03-25 13:31:06 +03:00
kd-11	321c360dcb	rsx: Overhaul rendertarget sampling/shuffles - Reimplements render target views used for sampling - Optimizes access using an encoded control token - Adds proper encoding for 24-bit textures (DRGB8 -> ORGB/OBGR) - Adds proper encoding for ABGR textures (ABGR8 -> ARGB8) - Silence some compiler warnings as well - TODO: Real texture views for OGL current method is a hack	2018-03-25 13:31:06 +03:00
kd-11	9bb1ed78f9	gl: Implement video-out calibration for gamma and dynamic range - Seems to be of limited use but if it is determined to be useful, a vulkan implementation can be done	2018-03-25 13:31:06 +03:00
kd-11	9fc1740608	rsx/fp: Fragment program overhaul - Separate TXB from TXL: They are completely different! - Properly perform TMU emulation in the fragment shader. Implemens SRGB conversion and alphakill at the moment - Properly perform ROP emulation in the fragment shader. Implements FRAMEBUFFER_SRGB. While support on the chip looks to be incomplete (and wierd), it does work - Document some more bits in SHADER_CONTROL register	2018-03-25 13:31:06 +03:00
kd-11	9f416e5ce1	rsx/gl/vk: Obey channel remapping on framebuffer resources if requested	2018-03-25 13:31:06 +03:00
kd-11	5817f9fe3f	rsx: Texture format fixes - Implement SRGB (gamma corrected) textures (DXT1, DXT3, DXT5, RGBA8 only) - Fix channel map decode for XY data texture formats - Fix remap layout for X16 textures (verified with Mass Effect 3)	2018-03-25 13:31:06 +03:00
scribam	50446f7fef	Partial compilation fixes for osx	2018-03-24 11:14:40 +00:00
pauls-gh	fd8d2ecbf4	Remove Volume Texture Compression (VTC) tiling for Vulkan, DX12 and ATI (OpenGL).	2018-03-23 12:01:30 +03:00
Megamouse	9d961f620b	rsx/Qt: add option to disable the shader compilation hint	2018-03-22 16:33:37 +04:00
kd-11	92fb828d52	gl: Compat support for mesa drivers Needs CLIENT_STORAGE bit set for persistent buffers to make them useful	2018-03-20 00:11:41 +03:00
kd-11	d13584f858	rsx: fixups gl/vk: Bump shader cache version gl/vk: Disable anisotropic override when strict mode enabled as it is proven to alter some games negatively gl: Clamp buffer view range to not exceed the backing buffer size. Also add assert for the same condition	2018-03-19 12:13:34 +03:00
kd-11	ffe6c9ba5a	fix linux builds	2018-03-13 18:55:03 +03:00
kd-11	f00d9a7c7f	rssx" Halfplement alpha-to-coverage AA transparency	2018-03-13 18:55:03 +03:00
kd-11	2dce55d036	rsx: ZCULL synchronization fixes - Track asynchronous operations in RSX core - Add read barriers to force pending writes to finish. Fixes zcull delay flicker in all UE3 titles without forcing hard stall - Increase zcull latency as all writes should be synchronized now	2018-03-13 18:55:03 +03:00
kd-11	315798b1f4	rsx: ZCULL rewrite and other improvements - ZCULL unit emulation rewritten - ZCULL reports are now deferred avoiding pipeline stalls - Minor optimizations; replaced std::mutex with shared_mutex where contention is rare - Silence unnecessary error message - Small improvement to out of memory handling for vulkan and slightly bump vertex buffer heap	2018-03-13 18:55:03 +03:00
kd-11	a19ffba8e8	rsx: Simplify MRT blend setup; Enable separable MRT blend on vulkan and fix corner cases for GL	2018-03-13 18:55:03 +03:00
kd-11	e230867492	rsx: Properly implement raster window offsets	2018-03-13 18:55:03 +03:00
kd-11	84b8a08d26	rsx: Basic performance counters	2018-03-13 18:55:03 +03:00
kd-11	4804efc17d	rsx: Clear up confusion on depth writes. According to the NV_fragment_program spec, its not feasible to have 16-bit depth wries NOTE: NV_fragement_program precedes NV_fragment_program2 which is very close to what RSX consumes. It is hardware from that era afterall	2018-03-13 18:55:03 +03:00
kd-11	053ab585f4	gl/vk: Clean up some format casts - TODO: Byte ordering considerations on data casts	2018-03-13 18:55:03 +03:00
kd-11	20d4c09a1c	rsx/vk/gl: Enforce format matching for render target resources. Fall back to raw data copy if match fails - Forces Bitcast of texture data if input format cannot possibly be the same as the existing texture format - rsx: Other minor improvements to texture cache :- - remove obsolete blit engine incompatibility warning. The texture will be re-uploaded if it is indeed incompatible - Implement warn_once and err_once to avoid spamming the log with systemic errors - Track mispredicted flushes - Reswizzle bitcasted texture data to native layout TODO: Also needs reshuffle according to input remap vector	2018-03-13 18:55:03 +03:00
kd-11	87741141f1	rsx/vulkan: Add post-compilation key validation and dynamically determine attachment write maks based on decompiled shader - A new step is added between decompilation and pipeline object creation allowing for properties to be updated based on shader contents - Allos masking off attachment writes that are unmodified in the shader	2018-03-13 18:55:03 +03:00
kd-11	705820c430	rsx: Nvidia driver compatibility workarounds - Sanitize NaN values before they reach the driver. On nvidia (X * NaN = X)	2018-03-13 18:55:03 +03:00
kd-11	6b23e733d0	rsx/gl/vk: Improvements - gl: Do not call makeCurrent every flip - it is already called in set_current() - gl: Improve ring buffer behaviour; use sliding window to view buffers larger than maximum viewable hardware range NV hardware can only view 128M at a time - gl/vk: Bump transform constant heap size When lots of draw calls are issued, the heap is exhaused very fast (8k per draw) - gl: Remove CLIENT_STORAGE_BIT from ring buffers. Performance is marginally better without this flag (at least on windows)	2018-03-13 18:55:03 +03:00
kd-11	07cbf3da48	rsx/gl: Minor fixes - Identify depth textures reaching the gpu via shader_read upload path - Use correct timestamp counter for opengl - inline draw_state::test_property because msvc doesnt do it for us	2018-03-13 18:55:03 +03:00
kd-11	8ccaabb502	vulkan: Optimize vertex data upload - Reuse buffer views as much as possible, vkCreateBufferView is slow on NV Implemented as a large sliding window, reuseable until it is filled	2018-03-13 18:55:03 +03:00

... 4 5 6 7 8 ...

1125 commits