rpcsx

mirror of https://github.com/RPCSX/rpcsx.git synced 2026-03-18 03:04:50 +01:00

Author	SHA1	Message	Date
kd-11	ac4cadf538	vk: Fix word index counting for shuffle tasks	2020-01-15 15:42:36 +03:00
kd-11	175f78f5b3	vk: Lower default compute heap size to 64M - There is no need to guess and use a large memory footprint as the heap is now dynamic.	2020-01-15 15:42:36 +03:00
kd-11	3d96fe79cc	vk: Implement dynamic sized compute heap - Implements a dynamically sized compute heap to allow growing up the size if it is too small.	2020-01-15 15:42:36 +03:00
Eladash	1ccb3c4492	rsx: Verify local memory offset	2020-01-15 13:23:56 +03:00
kd-11	8bbda3dedb	vk: Restructure command queue flushing behavior to avoid deadlock - Queueing commands on the offloader is a good idea but unfortunately page faults can still happen causing a cyclic dependency and eventual deadlock. Characterized by a vk::wait_for_event timed out error accompanied by severe hitching. - Drain the fault-able commands before pushing a submit operation to the queue. If a fault is in progress, bypass the queue system and submit raw. Technically this is incorrect but there isn't much that can be done about it right now.	2020-01-14 14:32:40 +03:00
kd-11	db5d03c340	vk: Generate dynamic binding table based on the capability of the drivers - This alleviates constraints imposed on shaders to allow running on some not-so-great platforms.	2020-01-09 15:38:23 +03:00
kd-11	ef3b0db7d8	vk: Workaround for NVIDIA occlusion query failure - When using partial results on NVIDIA, a non-zero result is returned even when the draw is fully occluded. This, I believe, violates spec which says the partial result shall be between 0 and the final result.	2020-01-08 19:02:45 +03:00
kd-11	3f34a0196c	overlays/osk: Add linear fade-in/out effect to OSK	2020-01-07 21:31:19 +03:00
kd-11	ecf00be155	rsx: Add color interpolation animation - Adds color interpolation and modulation pass and refactors the code a bit. Elements with this pass applied have their color modulated by the animated color from the pass. Modulation transform is multiplicative.	2020-01-07 21:31:19 +03:00
Nick Renieris	5bace118a7	overlays: Redesign animation system (add easing functions, fix bugs) Instead of speed, direction and distance, the user now specifies start/end offsets and how much time the transition should take. Fixes: - Stuttering caused from framerate estimation. - An edge case where animations would go over their supposed limit. Adds: - The ability to specify arbitrary easing functions for the animations - Implemented quadratic ease in and ease out and cubic ease in/out. - Usage of cubic ease in/out in the trophy notification	2020-01-06 22:42:07 +03:00
Nick Renieris	28770c1580	overlays: Move vertex & vector utility classes to new file	2020-01-06 22:42:07 +03:00
Nick Renieris	192912131e	rsx: Update vblank count in LLE mode	2020-01-06 22:42:07 +03:00
Dravonic	94d2f97f27	Multithreaded shader compliation follow-up (#7190 ) * Multithreaded load pipeline entries shader compliation stage Co-authored-by: kd-11 <15904127+kd-11@users.noreply.github.com>	2020-01-06 21:59:59 +03:00
kd-11	7f09def94e	rsx/vp: Properly initialize output registers. - All registers tested on hw show contents to be 0, 0, 0, 1. Make default output registers match this pattern.	2020-01-05 18:06:08 +03:00
kd-11	bdb5115c7f	rsx/overlays: Improve space usage on trophy dialog - Slightly increases the size of the trophy dialog and the font size. The old dimensions did not work with some libre fonts causing alignment errors and other problems.	2020-01-04 16:36:49 +03:00
kd-11	3ada97d2d3	rsx/overlays: Implement trophy notification queue - Allows to display more than one trophy at a time. Trophy notifications will simply get queued up and displayed at appropriate time.	2020-01-04 16:36:49 +03:00
kd-11	31b07fece5	rsx/overlays: Add support for animations - Adds animation support. This commit adds the base framework and implements a translate animation used to slide elements around the screen. This is then used to implement the sliding animation for the trophy notification.	2020-01-03 20:33:32 +03:00
Megamouse	5e7d25ad35	overlays: refactor shader loading dialogs	2020-01-03 14:22:40 +01:00
Megamouse	d94d094a7e	overlays: fix non-interactive dialog loops	2020-01-03 14:22:40 +01:00
Megamouse	c9aee27d48	VK: remove unused init function declaration	2020-01-03 14:22:40 +01:00
kd-11	d12762414a	vk: Change default vertex output value - Prefer w!=0 to avoid a situation where xyz/w = nan. More of a theoretical problem, but some calculations break down in such a situation.	2020-01-03 10:35:53 +03:00
kd-11	7786681954	rsx: Improve MTRSX synchronization - Properly synchronize DMA transfers when handling RSX pipeline barriers. Texture read barrier is used to signify completion of DMA routines and is often used to signal that Cell can overwrite vertex data!	2020-01-03 10:35:53 +03:00
kd-11	c4e59b5115	vk: Clamp depth export in FS - PS3 matches OGL behavior where writing to the depth export register results in clamping.	2020-01-01 22:39:20 +03:00
Eladash	9690854e58	Some cleanup * Prefer default initializer over std::memset 0 when possible and more readable. * Use std::format in trophy files name obtaining. * Use vm::ptr<>::operator bool() instead of comparing vm::ptr to vm::null or using addr(). * Add a few std::memset calls in hle where it matters (or in some places just to document an actual firmware memcpy call).	2019-12-31 22:27:27 +03:00
kd-11	915cf0bae8	vk: Do not leak mapped memory	2019-12-31 13:56:14 +03:00
Megamouse	c4b4ce46b8	cellSaveData: don't pause apps during dialogs	2019-12-29 14:22:58 +01:00
kd-11	24cb48971e	vk: Fix cb chunk synchronization deadlock	2019-12-29 13:49:46 +03:00
kd-11	e1b734fd12	rsx: Fix linux build	2019-12-29 13:49:46 +03:00
kd-11	ed2bdb8e0c	rsx: Zcull synchronization tuning - Also fixes a bug where sync_hint would erroneously update the sync tag even for old lookups (e.g conditional render using older query)	2019-12-29 13:49:46 +03:00
kd-11	fdb638436f	rsx: Add toggle for zcull sync behaviour - Adds a relaxed sync mode where ZCULL reports are lazily nudged into flushing and the main core does not actually wait for the event to finish before proceeding - Can drastically improve performance in cases where the game actually does not utilize the report data	2019-12-29 13:49:46 +03:00
kd-11	9f94a6dc11	vk: Refactoring and optimizations to query handling - Caches query results when looking up report availability to avoid entering driver code twice. - Minor code restructuring	2019-12-29 13:49:46 +03:00
kd-11	55ad9244c0	vk: Switch occlusion pool to FIFO rather than LIFO to avoid hard stall	2019-12-29 13:49:46 +03:00
kd-11	cdd9c12132	vk: Emulate conditional rendering for AMD	2019-12-29 13:49:46 +03:00
kd-11	93895838c7	vk: Implement hw conditional rendering	2019-12-29 13:49:46 +03:00
kd-11	a51395370e	vk: Implement multithreaded command submission - A few nagging issues remain, specifically that partial command stream largely caused by poor synchronization structures for partial CS flush and also the fact that occlusion map entries wait on a command buffer and not an EID!	2019-12-29 13:49:46 +03:00
kd-11	5be7f08965	rsx: Restructure ZCULL report retirement - Prefer lazy retire model. Sync commands are sent out and the reports will be retired when they are available without forcing. - To make this work with conditional rendering, hardware support is required where the backend will automatically determine visibility by itself during rendering.	2019-12-29 13:49:46 +03:00
kd-11	8dfea032f2	rsx: Remove deprecated do_method path that has been superceded by c++ inheritance for many years	2019-12-29 13:49:46 +03:00
Megamouse	ef6f565dbd	silence some annoying warnings	2019-12-28 15:40:57 +01:00
Emmanuel Gil Peyrot	9b77febd10	RSX: Remove two empty cpp files	2019-12-23 00:02:57 +03:00
linkmauve	e9c5c6e6bf	Move input to its own directory (#7126 )	2019-12-22 17:39:42 +01:00
Eladash	db4041e079	Implement rounded_div Round-to-nearest integral based division, optimized for unsigned integral. Used in sceNpTrophyGetGameProgress. Do not allow signed values for aligned_div(), align().	2019-12-20 14:47:04 +03:00
Emmanuel Gil Peyrot	e30173a835	rsx: Make X11 optional on Linux This makes it possible to build rpcs3 on a pure Wayland system, without the Xlib installed.	2019-12-20 10:48:03 +00:00
Nekotekina	321f7e7197	Fix missing-braces warnings	2019-12-13 03:21:43 +03:00
kd-11	73236efe58	vk: Remove some outdated code (#7060 )	2019-12-12 16:29:55 +03:00
Eladash	6a926daee7	rsx: Delay FIFO recovery point creation if is in in_begin_end scope (#7080 )	2019-12-12 15:38:56 +03:00
Eladash	7260af032e	rsx: Ignore or recover from unknown primitives This also fixes a bug when recovering FIFO or creating such recovery point inside in_begin_end == true scope.	2019-12-11 00:11:12 +03:00
Nekotekina	835892aa51	C-style cast cleanup VII	2019-12-05 02:10:15 +03:00
Nekotekina	d2fd3c6bc4	Commit `377e7d2a73`	2019-12-04 21:32:08 +03:00
Nekotekina	377e7d2a73	C-style cast cleanup VI	2019-12-04 17:56:22 +03:00
scribam	2eaaf5b132	vk: Add sampleRateShading to the list of device enabled features	2019-12-04 12:59:38 +03:00
Nekotekina	185c067d5b	C-style cast cleanup V	2019-12-03 17:23:00 +03:00
Nekotekina	28eacc616a	C-style cast cleanup III	2019-12-01 00:32:44 +03:00
Nekotekina	5b9df53c13	C-style cast cleanup (partial) Replace C-style casts with C++ casts.	2019-11-29 00:35:23 +03:00
Megamouse	f2b530823b	overlays: add dynamic switch for perf overlay	2019-11-27 10:34:03 +01:00
kd-11	8ca53f9c84	rsx: Remember to min-max the anchor indices of a polygon or triangle fan	2019-11-24 19:01:57 +03:00
kd-11	429a76a140	rsx: Remove redundant check	2019-11-23 16:11:18 +03:00
kd-11	41e7d2aa0a	rsx: Select correct image aspect for blit engine targets.	2019-11-19 13:18:15 +03:00
kd-11	fd751e3e7b	rsx: Improve blit format mismatch detection	2019-11-19 13:18:15 +03:00
kd-11	41c3180276	rsx: Fix invalid format checks for DMA sections which are typeless	2019-11-19 13:18:15 +03:00
kd-11	9dab0575fa	rsx: Add missing format check for the RTV<->DSV transfer case - TODO: Rewrite resource handling routines	2019-11-18 13:17:00 +03:00
kd-11	4a0e1c79ed	rsx: Improve format validation for blit engine - Check all possible cases where format mismatch is possible. - Warn if a slow path is going to be taken. Should help with future optimizations.	2019-11-18 13:17:00 +03:00
kd-11	c415578e79	vk: Clamp buffer row length to never be less than declared width - Fixes some games with broken textures	2019-11-18 13:17:00 +03:00
kd-11	2408922806	rsx: Do not ignore clamping for some routines that do not have implied range	2019-11-18 13:17:00 +03:00
kd-11	c10aa360b1	rsx: Remove more deprecated methods	2019-11-18 13:17:00 +03:00
Megamouse	a17a5a76a0	overlays: avoid division by zero	2019-11-15 14:53:18 +01:00
Megamouse	fb96047d2f	overlays: add settings for overlay graphs	2019-11-15 14:53:18 +01:00
Megamouse	dd1707bd46	overlays: fix center options when graphs are shown	2019-11-15 14:53:18 +01:00
Megamouse	d6b0361a02	overlays: perf_metrics_overlay to seperate header this is done to prevent severe conflicts with upcoming changes	2019-11-15 14:53:18 +01:00
Anuskuss	7e31c30133	Intel iGPU needs workaround on Windows	2019-11-15 12:08:16 +03:00
Nick Renieris	cc59d319e1	overlay: Performance graphs	2019-11-12 20:43:09 +01:00
kd-11	8234bdb8f0	vk: Check for heap change events after a grow to avoid spec violations - Avoid referencing the old buffer in stale views. Status can be set globally if requested during heap creation.	2019-11-10 17:53:12 +03:00
kd-11	5968427a2f	vk: Initialize queries before use - The spec does not guarantee that queries are initialized. In fact, it now says all queries must be reset before they are used for the first time.	2019-11-10 17:53:12 +03:00
kd-11	8ea9bc9874	vk: Reduce memory allocation sizes of default heaps - The heaps will grow as desired, no need to overallocate to cater to the most resource-hungry games	2019-11-10 17:53:12 +03:00
kd-11	0a32d478df	vk: Enable auto-growing of the data heaps for the performance case	2019-11-10 17:53:12 +03:00
kd-11	357e0d2097	vk: Implement explicit runtime flags to manage events like heap sync	2019-11-10 17:53:12 +03:00
kd-11	f359342721	rsx: Implement mutable ring buffers with grow support	2019-11-10 17:53:12 +03:00
kd-11	5f39a594ac	rsx: Clean up some unused legacy methods unnecessary after d3d removal	2019-11-10 17:53:12 +03:00
Emmanuel Gil Peyrot	56f82d2701	rsx: Wrap gsl::span definition into Utilities/span.h	2019-11-09 20:00:50 +01:00
Emmanuel Gil Peyrot	f76720ceb0	Remove extraneous ::narrow<int>() calls GSL’s gsl::span didn’t use the correct type for its index_type, which is why they were needed.	2019-11-09 19:30:06 +01:00
Emmanuel Gil Peyrot	72cdf0b04c	Replace gsl::span’s implementation with tcbrindle’s This implementation optimises correctly on all relevant compilers, unlike GSL’s which gave extremely slow code on any compiler other than MSVC. Supersedes #6948.	2019-11-09 19:30:06 +01:00
Emmanuel Gil Peyrot	ef368c5171	rsx: Replace gsl::byte with C++17’s std::byte	2019-11-09 19:30:05 +01:00
kd-11	7072489a6e	rsx: Implement point sprite coordinate generation - When the point sprite flag is set, overrides the input similar to the 2D mask. The returned X and Y values are always the gl_PointCoord values for the fragment. - Stacks with the 2D mask to override the z and w coordinates.	2019-11-09 12:50:53 +03:00
kd-11	63673b1a9f	rsx: Implement full color remap for the D24S8->ARGB8 converter	2019-11-08 19:11:59 +03:00
kd-11	8d1505752f	rsx: Validate depth test setup to avoid address contention	2019-11-07 11:32:44 +03:00
kd-11	508ffcb775	vk: Compute kernel fixups - Adhere to workgroup count limits as exposed by the GPU vendor. They already execute properly even when going beyond the limits but this removes validation noise. - Fix invocation counts for deswizzle kernel. The count was incorrect if blocksize was not 4, causing a bunch of useless work to be done.	2019-11-05 22:07:22 +03:00
kd-11	99d71fdc2a	vk: Implement layer batching for the GPU swizzle decoder - Handles all LODs per layer meaning cubemaps are now fully handled in 6 passes instead of 6 * (log2(width)) passes. - Handles all LODs of a 3D texture in one pass as well. - The improvements do warrant dropping down the number of allowed compute invocations a bit	2019-11-05 22:07:22 +03:00
kd-11	7a0b94f343	vk: Minor compute optimizations - Remove use of uniform buffers for compute static data. Use push constants instead. - Minor touchups to the deswizzle code to avoid redundant data copies.	2019-11-05 22:07:22 +03:00
kd-11	1266b63135	vk: Enable gpu deswizzling	2019-11-05 22:07:22 +03:00
kd-11	9cd3530c98	rsx: Set up framework for hw deswizzle	2019-11-05 22:07:22 +03:00
kd-11	57d3c9e171	rsx: Take empty queries into account for engines that spam report reads. - Some games will spam the report queue with requests but have zpass statistics enabled.	2019-11-04 18:48:41 +03:00
kd-11	2a8f2c64d2	rsx: Implement report transfer deferring - Allow delaying report flushes triggered by image_in or buffer_notify - When the report is ready, all the delayed transfers will automatically be done. - TODO: Make this configurable?	2019-11-04 18:48:41 +03:00
kd-11	3e0f9dff4d	vk: Improve zcull synchronization - Use zcull sync hints more aggressively	2019-11-04 18:48:41 +03:00
kd-11	fe3c290d03	vk: Reimplement occlusion result reading - Implement partial result reads	2019-11-04 18:48:41 +03:00
kd-11	51e0eaaddc	rsx: Implement backend notification for upcoming zcull reads	2019-11-04 18:48:41 +03:00
kd-11	df63de8f16	rsx: Allow u32 restart index with full index width	2019-11-04 16:56:34 +03:00
kd-11	6b3af09fa5	vk: Improved crash message for missing MSAA features	2019-11-04 16:56:34 +03:00
kd-11	bbed791ee0	vk: Add explicit support for identity image views - Allows bypassing all remap shenanigans to make some operations that rely on the raw image to work correctly.	2019-11-01 19:35:46 +03:00
kd-11	63bbf11a76	vk: Add video out calibration pass - Adds gamma correction and RGB range filters to output to match PS3	2019-10-31 14:43:24 +03:00
kd-11	78aefe5b5e	rsx/overlays: Add support for other primitive types other than triangle_strips	2019-10-31 14:43:24 +03:00
Nekotekina	e3e7051ed3	Minor optimization in BufferUtils.cpp Don't use PSHUFB for horizontal operations. Utilize PHMINPOSUW to compute max as well: + sse41_hmin_epu16 + sse41_hmax_epu16	2019-10-30 18:52:34 +03:00
Nekotekina	b1968769b7	Minor cleanup in BufferUtils.cpp Replace inline asm with intrinsic using target attribute trick.	2019-10-30 17:53:51 +03:00
linkmauve	cfd5cf6bdb	Optimise primitive_restart::upload_untouched() (#6881 ) * rsx: Optimise primitive_restart::upload_untouched() with SSE4.1 This optimisation is only applied when skip_restart is false. I’ve only tested the u16 codepath, as it is the one used in NieR. In some very unscientific profiling, this function used to take 2.76% of the total frame time at the save point of the port town, it now takes about 0.40%. * rsx: Mark all SSE4.1 functions with attributes on gcc and clang This assures the compiler we will take care of only calling these functions after having checked that the CPU does support these instructions. * rsx: Add an AVX2 implementation of primitive restart ibo upload * rsx: Remove redefinition of SSE4.1 instructions Now that clang is aware that our functions are compiled with SSE4.1, it lets us generate this code using its intrinsics. * rsx: Optimise vector to scalar conversion This is done using minpos and srli intrinsics and generate less code than before. Thanks Nekotekina for the suggestion!	2019-10-30 16:42:44 +03:00
kd-11	35794dc3f2	vk: Add checks for alphaToOne support - This feature is very rarely used, as alphaToCoverage is commonly used as a replacement for blending, not in addition to it.	2019-10-30 01:06:28 +03:00
kd-11	eda09489b2	vk: Optionally ignore depth bounds testing on hardware that does not support it.	2019-10-29 20:03:54 +03:00
kd-11	7a5c20ef85	vk: Minor spec touchups - Simplify active instance management. While multicontext support will be required in future, this is better done with multiple logical devices rather than multiple instances. - Destroy the WSI surface on exit - Enable depthBoundsTest explicitly. TODO: Properly check for supported features.	2019-10-29 20:03:54 +03:00
kd-11	aa3eeaa417	rsx: Separate subresource_layout:dim_in_block and subresource_layout::dim_in_texel - These two are not always linked when working with compressed textures. The actual texels extend past the actual size of the image if the size is not aligned. e.g if height is 1, the real height is 4, but its not possible to determine this from the aligned size. It could be 1, 2, 3 or 4 for example. - Fixes image out-of-bounds writes when uploading from CPU	2019-10-29 20:03:54 +03:00
Eladash	42fc698186	rsx: Enable primitive restart index only when needed (#6889 ) * rsx: Enable primitive restart index only when needed * rsx: Use if with initializer in read_put()	2019-10-28 23:16:27 +03:00
kd-11	479d92d075	vk: Fix uninitialized (and wrong) variable access	2019-10-28 15:20:45 +03:00
kd-11	b0708367c2	vk: Round lod bias to the nearest 0.5 to lower number of permutations when nearest mipmap sampling is used - The lambda values will be rounded to the nearest integer anyway	2019-10-28 15:20:45 +03:00
kd-11	3e8dfede1c	vk: Modify sampler cache to uniquely identify all the input parameters - Avoids iteration when variable mipmap counts or lod bias parameters change	2019-10-28 15:20:45 +03:00
kd-11	ad2add9574	rsx:: Use fcmp correctly	2019-10-28 15:20:45 +03:00
kd-11	d04241ad25	rsx: Allow compressed textures to be unaligned in size - Align based on row length but let the texture itself be of arbitrary dimensions	2019-10-28 15:20:45 +03:00
Emmanuel Gil Peyrot	69e9ee26f6	rsx: Make input_is_swizzled a template parameter This lowers the relative cost of this function from ~2.25% to ~1.80% on gcc 9 which I found quite surprising, some of it probably gets inlined better in the callers, but I haven’t been able to isolate which parts.	2019-10-28 13:28:51 +03:00
kd-11	d53d7bb598	vk: Restore vega native use of FP16 in shaders - AMD proprietary drivers should work fine	2019-10-23 12:20:06 +03:00
Emmanuel Gil Peyrot	54d95373d0	Support fullscreen properly on Wayland The current behaviour when going fullscreen from windowed was to keep the previous size of the swapchain, with black borders on all sides, which looks quite ugly. The root of this issue is that rpcs3 only checks for frame resize if vkQueuePresent() returns VK_SUBOPTIMAL_KHR, which drivers can’t do on Wayland, see https://gitlab.freedesktop.org/mesa/mesa/issues/1979	2019-10-23 12:19:46 +03:00
kd-11	e04b6cd7c0	rsx: Copypasta fix - r1 is always float4 never half4. Its a full-width register unlike the other outputs which are optionally half-width.	2019-10-23 00:50:24 +03:00
kd-11	00bc3fe658	Drop d3d12 backend	2019-10-22 21:45:14 +03:00
Emmanuel Gil Peyrot	14c63ec014	Fix misleading indent.	2019-10-22 16:11:43 +03:00
Eladash	586fe11e22	Fix cellGcm HLE regression Also correct flags.	2019-10-22 13:45:09 +03:00
Eladash	945abcc6cd	rsx: Align down index array offset * Also use improved to_be_t<> template (recetly ignoring one byte long types) for vm gsl::byte referencing, remove redundent narrow<> cast (same type)	2019-10-22 13:45:09 +03:00
kd-11	3bb70e837a	vk: Silly copypasta	2019-10-22 13:44:49 +03:00
kd-11	0b2f9f0f17	rsx: Add support for delayed shader discard. - Noticed a glitch on AMD hw and windows drivers where discard seems to affect entire 4x4 cells. - Dead fragments (outside the primitive boundary) could have their discards trigger as they do not have proper access to variables. - This introduces dead fragments along triangle edges, causing a diagonal line pattern across the screen that is very annoying.	2019-10-22 13:44:49 +03:00
kd-11	901942f24a	rsx: Replace pointless f32[4] restriction on texture parameters. - Use a struct instead to improve readability and remove pointless OpBitCast	2019-10-22 13:44:49 +03:00
kd-11	f7842b765f	rsx: Implement packed format renormalization - Renormalizes arbitrary N-bit values as 8-bit normalized. - NV hardware performs integer normalization at 8 bits if the size is less than 8. - This can cause significant arithmetic drift because the error is multiplied by a huge number when sampling.	2019-10-22 13:44:49 +03:00
Eladash	29cddc30f0	rsx: Fix vblank signals flood after Emu.Resume()	2019-10-21 15:31:45 +03:00
Eladash	5de0005f5a	rsx: Report full method range on invalid methods Also report full command on fifo desync event for the first time	2019-10-21 15:31:45 +03:00
eladash	730e9cde84	sys_rsx: Improve allocations and error checks * allow sys_rsx_device_map to be called twice: in this case the DEVICE address retrived from the previous call returned * Add ENOMEM checks for sys_rsx_memory_allocate and sys_rsx_context_allocate * add EINVAL check for sys_rsx_context_allocate if memory handle is not found * Separate sys_rsx_device_map allocation from sys_rsx_context_allocate's * Implement sys_rsx_memory_free; used by cellGcmInit upon failure * Added context_id checks * Throw if sys_rsx_context_allocate was called twice.	2019-10-21 15:31:45 +03:00
kd-11	3c44065684	gl: Fix copypasta - MSAA is still unimplemented in OGL	2019-10-20 21:38:40 +03:00
kd-11	f40f2c6215	vk: Fix minification filter description for NEAREST_MIPMAP_NEAREST. Just a typo. - Also remove mipmap filter for CONVOLUTION	2019-10-20 21:38:40 +03:00
kd-11	09de3b7974	rsx: Tweak behaviour of the "Use GPU texture scaling" option - If either source data or dest is a render target, do image operations on the GPU same as before - If swizzle is desired, use CPU fallback - If no scaling and no format conversion is required, use CPU fallback - If scaling is desired and the transfer target is in local memory, use the GPU - When doing trivial copies, use the routine in rsx_methods instead of duplicating code. Also has the benefit of better range checking.	2019-10-20 21:38:40 +03:00
kd-11	868547aec8	rsx: Minor improvement to fbo region invalidation - When commiting a block as fbo, keep blit_dst data as well. - Avoids removing (and losing data from) blit targets that just happen to share a page with a framebuffer.	2019-10-20 21:38:40 +03:00
kd-11	996534c559	rsx: Fixup for aspect mismatch	2019-10-20 15:25:07 +03:00
Eladash	d4ba7f37b6	rsx util: Implement decode_fxp<>	2019-10-18 15:41:39 +03:00
kd-11	299b98b30a	vk: Disable mipmap sampling if sampling mode is does not have a mipmap filtering mode. - GL_LINEAR and GL_NEAREST always sample LOD0 so make vulkan behave the same way	2019-10-18 14:46:37 +03:00
kd-11	404073c74a	rsx: Force-align compressed formats to 4x4 texel blocks and disable 1D compressed textures. - The PS3 allows defining 1D compressed images but this obviously doesn't work well on desktop.	2019-10-18 14:46:37 +03:00
kd-11	eff4e95c99	rsx: Minor cache fixup for cyclic references. - Logic was broken by mipmaps PR. Do not issue a texture barrier if a temp copy is being done.	2019-10-18 14:46:37 +03:00
kd-11	bd1bcc6be7	vk: Remove a redundant memory barrier	2019-10-18 14:46:37 +03:00
kd-11	70642484cd	vk: Check for cyclic references if sampler is marked as do-not-cache. - Usually an indication of surface/texture cache interaction.	2019-10-18 14:46:37 +03:00
kd-11	eee2237e19	rsx: Track uncached cache resources - Uncacheable resources can be reused as soon as they're made visible to the draw call. - Since they're likely to be reused every draw call until the shader changes, it is important to reuse as much as possible	2019-10-18 14:46:37 +03:00
kd-11	decf9cfcf6	rsx: Notify the backend to release or delete temporary surfaces after we're done with them.	2019-10-18 14:46:37 +03:00
kd-11	97ed95d21b	vk: Add video memory manager to monitor VRAM usage	2019-10-18 14:46:37 +03:00
kd-11	1046184dd0	rsx: Fix some uninitialized variables flagged by valgrind	2019-10-18 00:32:38 +03:00
kd-11	5af8a9fbbc	rsx: Fix decoding of some fixed point texture parameters - Checked envydocs and found the correct format as fixed-point 4.8 with optional sign bit	2019-10-17 18:18:00 +03:00
kd-11	a936e43ff6	rsx: Fixup for slice gathering for structures with multiple mipmap levels - TODO: Proper multi-level assembly for non-2D structures	2019-10-17 18:18:00 +03:00
kd-11	e47b4ffb8f	rsx: Fix rsx capture crash. - Pixel coordinates are top-left not bottom-right - Solves out of bounds access	2019-10-17 18:18:00 +03:00
kd-11	e166dbccc8	rsx: Fix visibility of blit destination targets	2019-10-17 18:18:00 +03:00
kd-11	0c35595ce2	rsx: Remove the alpha-to-coverage hack that was added to hide the missing mipmaps in games - Moves to a purely stochastic function using dithering to simlulate coverage	2019-10-17 18:18:00 +03:00
kd-11	f0ed0285f3	rsx: Implement range-based subresource descriptor cache - The previous address-based approach was pretty awful when it comes to invalidating	2019-10-17 18:18:00 +03:00
kd-11	fbb9ed4e25	rsx: Add explicit range to cached subresource descriptors	2019-10-17 18:18:00 +03:00
kd-11	c9e3a321b2	rsx: Fixup for surface cache scanning - Fix regression when gathering cubemaps	2019-10-17 18:18:00 +03:00
kd-11	1ac976771c	rsx: Add some texture search options for the cache - Potentially optimizes texture cache searching using explicit options	2019-10-17 18:18:00 +03:00
kd-11	840b52fe80	rsx: Implement mipmap gathering from texture cache	2019-10-17 18:18:00 +03:00
kd-11	d6d8766f8d	rsx: Refactoring - Move some helper routines out of the cache core - Prep for multi-layered image search	2019-10-17 18:18:00 +03:00
kd-11	cb362b4085	rsx: Runtime check on RTT cast	2019-10-17 02:30:03 +03:00
kd-11	5c7bbb3354	vk: Fixup - Removes incorrect line writing stencil flags to a regular texture.	2019-10-17 02:30:03 +03:00
kd-11	d29b6cdb59	vk: Proper workaround for VEGA float16_t bugs	2019-10-16 22:40:50 +03:00
kd-11	a6e143254a	vk: Add workaround for broken format conversion in older GeForce cards	2019-10-16 22:40:50 +03:00
kd-11	4f088a102c	vk: Add kepler and maxwell tables	2019-10-16 22:40:50 +03:00
plappermaul	2171ffdab2	minor optimization for FIFO_control::read_put() (#6768 )	2019-10-14 21:26:31 +03:00
kd-11	42aa4c5000	gl: Vendor-specific tuning	2019-10-13 19:00:05 +03:00
kd-11	776fa54d22	gl: Fix missing case	2019-10-13 19:00:05 +03:00
kd-11	27f48fbc06	gl: Rewrite image transfer operations to support image subregions - Working exclusively with full sized images is very expensive	2019-10-13 19:00:05 +03:00
kd-11	d9a9766e41	gl: Refactoring and fallback support for compute acceleration	2019-10-13 19:00:05 +03:00
kd-11	b39bfa02a6	gl: Windows bringup	2019-10-13 19:00:05 +03:00
kd-11	105d4b51e6	gl: Use compute shaders for typeless texture decode	2019-10-13 19:00:05 +03:00
kd-11	7a6e2e716f	gl: Add a framework for compute shaders	2019-10-13 19:00:05 +03:00
Markus Stockhausen	4d99169d51	Patch v2 for vkCreateInstance() as requested	2019-10-11 21:16:36 +03:00
Markus Stockhausen	8adcb8046b	Patch for vkCreateInstance() patch as requested	2019-10-11 21:16:36 +03:00
Markus Stockhausen	f5817cb430	Error handling for vkCreateInstance() Cry in log if initialization failed.	2019-10-11 21:16:36 +03:00
Eladash	397007cf8b	rsx: Fix FIFO_DRAW_BARRIER substituation	2019-10-11 12:34:53 +03:00
Eladash	9242f16560	rsx: Improve FIFO recovery from flip	2019-10-10 19:34:23 +03:00
Eladash	06017cb14e	rsx: Recover from invalid writes to CELL_GCM_NV4097_SET_INDEX_ARRAY_DMA Also: Trigger a FIFO recovery when encountering an invalid method.	2019-10-10 19:34:23 +03:00
Eladash	2eaf5df60b	rsx: Register some more methods	2019-10-10 19:34:23 +03:00
kd-11	305a5bd717	typo fix	2019-10-05 12:01:46 +03:00
kd-11	4a19a2dd24	rsx: Explicity describe transfer regions for both source and destination blocks	2019-10-04 18:10:46 +03:00
kd-11	7aed9c3f13	gl: Add missing input declarations for 2-sided lighting	2019-09-30 21:52:43 +03:00
kd-11	88229f4716	gl: Remember to unbind attachments from active framebuffer after clear - If a stale reference is left lying around (e.g the texture bound to depth has been deleted and we attach a color image) no operations actually take place. glCheckFramebufferStatus also does not catch this problem.	2019-09-30 21:52:43 +03:00
Eladash	0b2fa6ffdc	rsx: Flush FIFO GET before smeaphore_acquire	2019-09-30 17:30:15 +03:00
Eladash	70b4ae6bd6	rsx: Optimize FIFO PUT masking	2019-09-30 17:30:15 +03:00
kd-11	bcf8799079	rsx: Fix missing point size export - Sometimes program-point-size is enabled, but the vs does not actually write to the point size register. In this case, pass the incoming point size along instead of the default register init.	2019-09-30 01:40:04 +03:00
Eladash	319fc8c55d	rsx: Mask FIFO PUT on rsx execution	2019-09-29 13:05:24 +03:00
Eladash	822287b418	rsx: Avoid unsigned/signed mismatch with fifo ret addr	2019-09-29 13:05:24 +03:00
kd-11	8cfd3b56d6	vk: Increase wait timeout in case of problematic GPU loads causing heavy stutter - When compiling LLVM objects, it is possible to starve the driver thread and cause the timeouts to trigger - Observed in RE6 when using SPU LLVM since the game generates a very large number of objects "infinitely"	2019-09-29 11:39:22 +03:00
kd-11	ef5b56bc48	rsx: Align width properly when normalizing to avoid fractional results being lowered to 0	2019-09-29 11:39:22 +03:00
kd-11	69c090b14a	vk: Check frame descriptors before rendering in case of a flip request between begin() and end() - There is no reason to delay async flip requests since most of the work can be handled during rendering anyway	2019-09-29 11:39:22 +03:00
kd-11	1464069476	rsx: Restructure deferred flip queue handling - Allows frameskipping to occur naturally if RSX thread is bombarded with flip requests but just jumping to the last one if possible - See request_emu_flip() for async frame submission and implicit skipping - Also allows display queue to fill faster than the flip thread can drain the queue	2019-09-28 21:13:56 +03:00
Nekotekina	bd1a24b894	Tidy endianness support (se_t) implementation Move se_t and se_storage to util/endian.hpp Use single template instead of two specializations. Add minor optimization for MSVC. Remove v128 dependency. Try to enable intrinsics for unaligned data. Fix minor bug in u16/u32/u64 specializations.	2019-09-28 15:39:50 +03:00
kd-11	2275259bf5	rsx: Properly scale overlay passes to match drawable area	2019-09-28 13:24:14 +03:00
kd-11	28534e8833	gl: Remove a debug print	2019-09-28 13:24:14 +03:00
kd-11	e53e98749f	rsx: Add missing initialization	2019-09-27 21:07:56 +03:00
Nekotekina	3c72069ae6	cellOskDialog: use g_fxo	2019-09-26 23:26:36 +03:00
Nekotekina	5f9c5e8765	Use g_fxo for rsx::thread	2019-09-26 23:26:36 +03:00
kd-11	ee0633f43a	vk: Add turing workaround - Turing crashes if using the depth->color transfer hack	2019-09-26 20:12:25 +03:00
kd-11	acc986be3f	vk: Add chip family detection	2019-09-26 20:12:25 +03:00
Jan Beich	5ec35c7daa	rsx: unbreak build with Clang 9 ld: error: rpcs3/CMakeFiles/rpcs3.dir/main_application.cpp.o: unable to find library from dependent library specifier: opengl32.lib ld: error: rpcs3/Emu/librpcs3_emu.a(GLGSRender.cpp.o): unable to find library from dependent library specifier: opengl32.lib ld: error: rpcs3/Emu/librpcs3_emu.a(GLRenderTargets.cpp.o): unable to find library from dependent library specifier: opengl32.lib ld: error: rpcs3/Emu/librpcs3_emu.a(GLVertexBuffers.cpp.o): unable to find library from dependent library specifier: opengl32.lib	2019-09-24 01:00:45 +03:00
Megamouse	7193d407b9	Input: Remove unused flush member	2019-09-20 22:12:40 +02:00
kd-11	1a892c6b1b	rsx: Avoid recursion in flip handler	2019-09-20 15:08:41 +03:00
Megamouse	aa7eb1536a	overlays: fix enter button assignment in osk	2019-09-20 10:53:09 +02:00
kd-11	e0005ec347	rsx: Refactoring and improvement - Separate displayed statistics from actual backend statistics. Allows asynchronous flipping to work correctly as it just uses display stats. The real stats are used by the frame scope marker to determine behavior like engaging the FIFO optimizer or skipping draw calls correctly.	2019-09-19 23:10:09 +03:00
kd-11	2c76f47eec	rsx: Restructure flip code and frame scoping - Add an explicit frame scope marker tied in with the queue_prepare command Since queue_prepare is emitted at the end of a frame, it can be used as end-of-frame in games that emit this - If this command is not emitted, fifo flatenner and frameskip will not work	2019-09-19 23:10:09 +03:00
Nekotekina	a4951ec407	Use g_fxo for global lv2_memory_container	2019-09-18 21:24:04 +03:00
kd-11	bd4d86f87a	vk: Properly test MSAA sample mask when switching between states inside a RSX renderpass. - Before, these changes would be lost if the same RTT config was used with varying mask setups	2019-09-18 15:42:59 +03:00
kd-11	c59cb1bdd3	rsx: Allow only sse4.1 capable CPUs to take the accelerated index path - Older sets lack the required min/max functionality	2019-09-13 12:28:52 +03:00
kd-11	52e8747b83	rsx: Workaround for exit deadlock - Avoids games locking up when the stop button is pressed	2019-09-12 23:32:21 +03:00
kd-11	cc313b052f	rsx: Improve hit testing when scanning for overlapping surfaces - Calculate exact sizes when doing hit tests to avoid false negatives - Defer page checking until actually require to do memory setup - Introduce align2 helper to do non-pow2 alignments	2019-09-12 23:32:21 +03:00
kd-11	9842823a8c	rsx: Check if memory actually exists when overallocating blit targets	2019-09-12 23:32:21 +03:00
kd-11	cd1345b6bb	rsx: Do not use nul section if resolution scaling is active on a surface	2019-09-12 23:32:21 +03:00
kd-11	858014b718	rsx: Experiments with nul sink	2019-09-12 23:32:21 +03:00
kd-11	212ac19c11	vk: Reimplement DMA synchronization	2019-09-12 23:32:21 +03:00
kd-11	f06559412e	vk: RDB fixup	2019-09-12 23:32:21 +03:00
kd-11	7fdb4976d8	rsx: Remove log spam for cond render	2019-09-12 14:08:21 +03:00
kd-11	60845daf45	rsx: Improve use of CPU vector extensions - Allow use of intrinsics when SSSE3 and SSSE4.1 are not available in the build target environment - Properly separate SSE4.1 code from SSSE3 code for some older proceessors without SSE4.1	2019-09-12 14:08:21 +03:00
kd-11	27af75fe71	rsx: Fixup for blit engine when moving inverted regions - Properly calculate overlap range when sections are inverted - Simplify transfer logic for inverted regions	2019-09-11 23:30:55 +03:00
kd-11	412c620b9d	rsx: Allow sampling from shader_read resources for blit engine - With harmonization between all texture types implemented, there is no difference between blit_engine_src and shader_read for supported formats - Adds extra format filtering to ensure no conflicts when copying data	2019-09-10 16:54:02 +03:00
kd-11	75fcfac00e	rsx: Modify find_cached_texture to respect gcm_format. Can pass 0 for "dont care"	2019-09-10 16:54:02 +03:00
kd-11	d1603fbb0b	vk: Crop malformed image descriptors - Some image descriptors (lle vdec?) are malformed with pitch being smaller than width - Crop these for now pending hardware tests	2019-09-08 18:22:27 +03:00
kd-11	f53361b966	rsx: Fix fast texture copy when src_pitch != width * block_size - Happens on mipmapped linear images	2019-09-08 18:22:27 +03:00
kd-11	0af9685381	rsx: Deprecate surface_transform::argb_to_bgra which is no longer required. - vulkan now uses native swizzle mapping for both surface and texture	2019-09-08 13:56:41 +03:00
kd-11	312bf6840e	vk: Fix surface_transform::argb_to_bgra transfers when no scaling is requested	2019-09-08 13:56:41 +03:00
kd-11	cbce309199	vk: Fix depth_stencil scaling	2019-09-08 13:56:41 +03:00
kd-11	48a5cd545f	gl: Do not byteswap uint24_8 as it needs a custom 8_24 decoder	2019-09-08 13:56:41 +03:00
kd-11	440d58f2ff	vk: Batch compute jobs when doing texture upload - Reduces overall number of invocations	2019-09-07 16:23:20 +03:00
kd-11	6aa0b49dbc	vk: Prefer using native alignment when uploading. - Allows using fast copy paths and reduces memory and compute footprint	2019-09-07 16:23:20 +03:00
kd-11	a3a0cb8c17	rsx: Minor texture optimizations	2019-09-07 16:23:20 +03:00
kd-11	efa501dac6	rsx/vp: Set default inputs to (0, 0, 0, 1) - From some hw tests, it seems this is the default.	2019-09-06 17:08:28 +03:00
kd-11	f8dbe281a5	glsl: Explicitly declare const inputs as such - Avoids copying the values to temp variables before invoking function calls - Generates shorter, cleaner AST and SPV bytecode	2019-09-06 17:08:28 +03:00
kd-11	14aa3b3360	vk: Remember to allocate enough vertex layout storage objects! - vertex_layout_storage descriptors were added but the descriptor count was not updated	2019-09-05 19:43:39 +03:00
kd-11	360c0e9af6	vk: Restructure commandbuffer scoping to allow faults in vertex upload - Defer renderpass open to allow recovery after fault in the middle of vertex upload	2019-09-05 19:43:39 +03:00
kd-11	9dc06cef7f	rsx: Do not include ro data when attempting to do section merge - Avoids crazy situations like trying to merge from a 3d or cubemap in memory	2019-09-02 16:49:04 +03:00
kd-11	e99e8460fe	rsx/texture_cache_utils: Warnings cleanup	2019-09-01 18:59:50 +03:00
kd-11	27fabd7607	rsx/ring_buffer: Warnings cleanup	2019-09-01 18:59:50 +03:00
kd-11	33609717f8	rsx/cache: Warnings cleanup	2019-09-01 18:59:50 +03:00
kd-11	f8617500b5	rsx/methods: Warnings cleanup	2019-09-01 18:59:50 +03:00
kd-11	7f7b499303	rsx/util: Warnings cleanup	2019-09-01 18:59:50 +03:00
kd-11	0158a88c88	rsx/textures: Warnings cleanup	2019-09-01 18:59:50 +03:00
kd-11	401bd9112a	rsx/prog: Warnings cleanup	2019-09-01 18:59:50 +03:00
kd-11	652f18ebaa	rsx/buffers: Warnings cleanup	2019-09-01 18:59:50 +03:00
kd-11	6504daa713	overlays: Warnings cleanup	2019-09-01 18:59:50 +03:00
kd-11	94656ac1e3	rsx/vp: Warnings cleanup	2019-09-01 18:59:50 +03:00
kd-11	0ee9d7b46d	rsx/fp: Warnings cleanup	2019-09-01 18:59:50 +03:00
kd-11	756fdedbf6	vk: Warnings cleanup	2019-09-01 18:59:50 +03:00
kd-11	a7b9ff33d8	gl: Warnings cleanup	2019-09-01 18:59:50 +03:00
kd-11	7f99de36c1	rsx: Fixup for surface_target_a flag being broken - While the mask for surface_a is at index 0, the surface cache expects the order to be maintained correctly! Set the correct mask since surface store now checks each RTT individually	2019-08-30 21:46:19 +03:00
kd-11	61af2b7dfc	vk: Workgroup tuning for different vendors	2019-08-30 21:46:19 +03:00
kd-11	99fb6d6a5d	rsx: Allow GPU-accelerated stream manipulation when doing texture uploads	2019-08-30 21:46:19 +03:00
kd-11	e0a7912d7c	rsx: Check for stencil writes when determining zeta_write flag	2019-08-30 21:45:41 +03:00
kd-11	04c808b8ab	rsx: Fixup for MRT color write lookup and surface_target_a	2019-08-28 16:12:10 +03:00
kd-11	e55d216619	rsx: Workarounds for some buggy games - Replace assert with log message until hardware testing confirms findings	2019-08-28 14:54:51 +03:00
kd-11	e334a43169	rsx: Fix surface cache hit tests - Avoid silly broken tests due to queue_tag being called before pitch is initialized. - Return actual memory range covered and exclude trailing padding. - Coordinates in src are to be calculated with src_pitch, not required_pitch.	2019-08-28 14:54:51 +03:00
JohnHolmesII	cca6a19cdd	Fix improper string concatenation in rsx_decode.	2019-08-28 01:26:14 +03:00
kd-11	2962e05f26	rsx: Implement per-RTT color masks - Also refactors and simplifies some common code in surface store and rsx core	2019-08-27 21:59:02 +03:00
kd-11	27aeaf66bc	gl: Restructure buffer objects to give more control over usage - This allows creating buffers with no MAP bits set which should ensure they are created for VRAM usage only - TODO: Implement compute kernels to avoid software fallback mode for pack/unpack operations	2019-08-27 21:59:02 +03:00
Nekotekina	dd79a5efb7	Remove fxm::make_always	2019-08-27 03:50:15 +03:00
Nekotekina	d2eba2387b	Use g_fxo for display_manager	2019-08-27 03:50:15 +03:00
Nekotekina	928719b658	Use g_fxo for rsx::avconf	2019-08-27 03:50:15 +03:00
Nekotekina	38a06c4b14	Use g_fxo for SysRsxConfig Rename to lv2_rsx_config	2019-08-27 03:50:15 +03:00
kd-11	3317e13b64	rsx: Hotfix for semaphore timeout bug - Add pending flip requests as a reason to invoke the RSX local task handler and release the vblank semaphore	2019-08-26 22:33:29 +03:00
Megamouse	32b5b11a83	cellSaveData/overlays: prevent possible array out of bounds in list view	2019-08-26 21:09:20 +02:00
kd-11	eed32cf3a4	rsx: Decompiler fixups and improvements - Fix 2D coordinate sampling of W coordinate. W is actually HPOS.w and not 1. Z is however always 0. - Optimize register usage a bit Disassembling compiled SPV shows that global declaration results in less ops than using inout modifiers. Modifiers generate extra mov instructions.	2019-08-26 20:03:31 +03:00
kd-11	3e28e4b1e0	rsx/decompiler: Restructure program register behavior - Fix reading of varying registers in FP Different registers have different behavior - Always write to varying registers. If a register is not written to, it is initialized to (0, 0, 0, 1) - Reimplements two-sided lighting correctly without hacks - Also bumps shader cache version	2019-08-26 20:03:31 +03:00
kd-11	fe6ff8622a	rsx: Decompiler fixups for conditional execution - Cond actually obeys vector mask	2019-08-26 20:03:31 +03:00
kd-11	f9aea076ae	rsx: Implement depth_buffer_float support. - Since this is transparent to the application at all time, it only becomes a problem when doing memory transfer or DEPTH->RGBA conversion in shaders.	2019-08-26 20:03:31 +03:00
kd-11	9d981de96d	rsx: Fix offloader deadlock - Do not allow offloader to handle its own faults. Serialize them on RSX instead. This approach introduces a GPU race condition that should be avoided with improved synchronization. - TODO: Use proper GPU-side synchronization to avoid this situation	2019-08-25 22:09:20 +03:00
Megamouse	896cfd2ade	cellSaveData/cellMsgDialog: implement cellSaveDataDelete	2019-08-22 08:05:12 +02:00
Megamouse	2d29a33ea8	cellSaveData/overlays: initialize with focused entry	2019-08-22 08:05:12 +02:00
Megamouse	b3c1759853	cellSaveData/overlays/Qt: fix some warnings and a possible nullptr deref	2019-08-22 08:05:12 +02:00
kd-11	7c5bde4aeb	rsx: Update tag timestamp to match newest inherited data - Avoids memory appearing older when used for depth test without depth write The write_barrier before the call will inherit new data but the tag will not update as no new information is added.	2019-08-21 21:17:15 +03:00
kd-11	c67c97844e	rsx: Fixup for blit engine range calculations	2019-08-21 21:17:15 +03:00
kd-11	5d1b7eb945	rsx: Fix reference leaks in texture_cache<->surface_cache communication - Properly commit orphaned blocks not invalidating existing cache structures - Do not ignore overwritten objects when commiting as unprotected fbo. Avoids stale references to invalidated surface objects.	2019-08-21 21:17:15 +03:00
kd-11	ca8b0da141	gl: Invalidate range before reading to prevent deadlock	2019-08-21 21:17:15 +03:00
kd-11	141072023b	rsx: Fix handling of ARGB8 memory - Load into memory as straightforward BGRA - Fixes a bug in vulkan caused by byte shuffling in blit engine vs shader access - Removes the need for memory shuffling when transferring into a rendertarget	2019-08-21 21:17:15 +03:00
kd-11	9cd5325962	rsx: Free memory 'held hostage' by storage sections in the surface cache - Once the memory has been captured by another surface, release the allocation	2019-08-21 21:17:15 +03:00
kd-11	be98554b40	rsx: Fix surface split logic - Calculations are supposed to be done based on the properties of the outgoing surface	2019-08-21 21:17:15 +03:00
kd-11	67dac94704	rsx/fp: Zero-initialize FragDepth register to match hw	2019-08-21 21:17:15 +03:00
kd-11	35e61c77e0	gl: Fixup for D24S8 readback	2019-08-21 21:17:15 +03:00
kd-11	dca29def5e	rsx: Temporary workaround for race condition in blit engine	2019-08-18 20:45:48 +03:00
kd-11	5e299111cc	rsx/vk: Restructure surface access barriers and implement RCB/RDB - Implements render target data load (aka Read Color Buffer/Read Depth Buffer) - Refactors vulkan surface barrier to be much cleaner. - Removes redundant surface barrier invocations after doing a merged load from surface cache. - Adds explicit access modes when gathering surfaces from cache.	2019-08-18 20:45:48 +03:00
kd-11	dfe709d464	rsx: Surface cache restructuring - Further improve aliased data preservation by unconditionally scanning. Its is possible for cache aliasing to occur when doing memory split. - Also sets up for RCB/RDB implementation	2019-08-18 20:45:48 +03:00
Eladash	500a4fa2fb	rsx: Fix potential out of range methods execution (can result in segfaults)	2019-08-17 17:26:04 +01:00
Pierre-Loup A. Griffais	56011cbddd	vk: don't die on VK_SUBOPTIMAL_KHR in AcquireNextImage, and recreate swapchain vkAcquireNextImageKHR can also return VK_SUBOPTIMAL_KHR and is non-fatal. However, it's a good idea to still recreate the swap chain later to maintain optimal presentation paths after temporary occlusion.	2019-08-16 20:09:37 +03:00
kd-11	a0f0c418d7	gl:Implement proper support for packed 16-bit rendertargets - Also some minor refactoring	2019-08-15 14:00:17 +03:00
kd-11	7f85b18b46	gl: Add support for 4444 typeless texture	2019-08-15 14:00:17 +03:00
Megamouse	8debdfcd09	handle empty callback returns	2019-08-14 23:54:09 +02:00
RipleyTom	87bf0386c4	Screenshot function	2019-08-14 19:24:42 +02:00
Eladash	7fda07eb5b	rsx: UB fix (signed vs unsigned mismatch)	2019-08-13 20:48:50 +01:00
Eladash	519fe9309e	rsx: Fix nv0039::buffer_notify	2019-08-13 20:48:50 +01:00
Eladash	527b1bb071	rsx: Fix overlapping transfer of nv3089::image_in when out_pitch != in_pitch or out_pitch != out_bpp * out_w	2019-08-13 20:48:50 +01:00
kd-11	8866a3d6a9	rsx: Cleanup for blit engine fixes	2019-08-10 16:45:02 +01:00
kd-11	033836d88c	rsx: Minor fixup for nv3089::image_in - Typo scale_x->scale_y - Remove convoluted temp buffer creation and just use vector instead	2019-08-08 15:48:22 +03:00
kd-11	f0bd0b5a7c	rsx: Conditional render sync optimization - ZCULL queue was updated to one-per-cb but the conditional render sync hint was not updated. - Do not unconditionally flush the queue unless the upcoming ref is contained in the active CB. - This avoids spamming queue flush, which frees up resources and improves performance	2019-07-30 21:13:42 +03:00
Malcolm Jestadt	d689a6e47b	vk: Don't warn RADV users on LLVM 8.0.1 - The 'back screen' issue on RADV was resolved with LLVM 8.0.1	2019-07-30 19:56:05 +03:00
Nekotekina	f63e89f9b4	Implement waitable atomics Moved Atomic.h to util/atomic.hpp List source files in CMakeLists.txt	2019-07-29 03:04:55 +03:00
Nekotekina	ec2db8edbc	Correct get_int_t to get_uint_t. Add get_sint_t.	2019-07-29 00:12:07 +03:00
kd-11	1de90bdb1f	rsx: Improve aliased data preservation - Carve out inherited region if any - Perform pitch compatibility test before assigning old_surface	2019-07-27 16:09:21 +03:00
Eladash	230c3d55b6	Fixup	2019-07-27 04:03:29 +01:00
Eladash	fcc75c8b0f	rsx: Write atomically semaphore updates and fix zcull timestamp	2019-07-26 21:27:55 +03:00
Eladash	c53f0dd7b5	rsx: Fix gcm unmap events	2019-07-26 21:27:55 +03:00
Megamouse	71c56b719c	Emu/overlays: fix background picture path	2019-07-25 08:53:07 +02:00
Eladash	85b1152e29	Timers scaling and fixes	2019-07-23 00:09:01 +01:00
kd-11	9a7c2784f0	rsx: Do not clip scissor to viewport when doing buffer clear	2019-07-20 16:39:32 +03:00
kd-11	e2574ff100	rsx: Support CSAA transparency without multiple rasterization samples enabled	2019-07-19 15:49:08 +03:00
kd-11	b5a2f0df68	rsx: Implement separate viewport raster clipping - Merge viewport raster window and scissor into one clipping region - Viewport raster clip is different from viewport geometry clipping in hardware as the latter is configurable separately	2019-07-19 14:21:19 +03:00
scribam	a268415121	vk: Use macros from Vulkan SDK	2019-07-17 17:56:29 +03:00
kd-11	ea2f4d57fa	rsx: Fixups	2019-07-17 13:29:42 +03:00
kd-11	113a49e00c	rsx: Handle cyclic references when doing memory inheritance	2019-07-17 13:29:42 +03:00
kd-11	34b06453f9	rsx: Handle lost data due to unused data sections - After splitting, the sections may not be referenced at all for anything other than just pixel storage - In such cases, either merge down or sample from the upstream source instead	2019-07-17 13:29:42 +03:00
kd-11	998717659f	rsx: Fix reference leak when cloning surfaces	2019-07-17 13:29:42 +03:00
kd-11	009e01a347	rsx: Set up for multi-section inheritance	2019-07-17 13:29:42 +03:00
Malcolm Jestadt	94af3b3f03	vk: Fix Linux Vega float16_t workaround - It was disabling float16_t for non Vega cards	2019-07-12 12:25:46 +03:00
Eladash	17c8ac9ab8	rsx: Debugger output text fix	2019-07-12 00:19:56 +03:00
Eladash	c4d8ef4340	rsx: Allow to configure vblank rate Removed "HLE protection" hack from sys_rsx_context_attribute	2019-07-12 00:19:56 +03:00
Silent	f3551cedb7	rsx: Swap R and B channels in SET_BLEND_COLOR since this color is BGRA, not RGBA	2019-07-11 22:51:01 +03:00
kd-11	2898309f68	vk: Silence some debug prints - This message confuses some users	2019-07-11 13:22:13 +03:00
kd-11	fc09572648	rsx: Implement texel border decode - Texel borders are no longer actually supported in modern APIs - Removes the border texels and uses border color instead which is incorrect but should work fine	2019-07-11 13:22:13 +03:00
kd-11	d8f753f1e8	rsx: Do not allow framebuffer surfaces that exceed their allocated pitch dimensions - Truncate surfaces to forcefully fit inside the declared region	2019-07-11 13:22:13 +03:00
Eladash	78e447e28c	rsx: Typo fix	2019-07-09 22:47:55 +03:00
kd-11	2548057ea0	vk: Improve AMD driver support - Workaround broken fp16 in AMDVLK/RADV - Do not disable primitive restart as the issue seems to have been fixed	2019-07-09 16:27:59 +03:00
kd-11	956270d9be	gl: Add readback/writeback config for format GL_R16	2019-07-09 16:27:59 +03:00
kd-11	c072c511a1	rsx: Add support for slice padding rows when gathering slices for cubemap/3d	2019-07-09 16:27:59 +03:00
kd-11	9ca6546dec	vk: When reusing resources, make sure to reinitialize the component layout	2019-07-09 16:27:59 +03:00
kd-11	0cc672dcb3	vk: "Improve" initialization hack - Change default alpha to 1 from 0 - TODO: Implement memory tagging for synchronizing this	2019-07-09 16:27:59 +03:00
kd-11	219a5382f7	rsx: If no array streams are enabled, mark inline array as disabled (null render)	2019-07-09 16:27:59 +03:00
kd-11	7840cd914e	rsx: Fixup nv3089::image_in - Correct pitch when sourcing from temp block - Remove obsolete? double transfer that also introduced a stale pointer reference to freed memory	2019-07-09 16:27:59 +03:00
kd-11	c47f4fd59e	vk: Fix frame skipping	2019-07-09 16:27:59 +03:00
Nekotekina	b9130dd663	Remove redundant const on return value in rsx_methods.h	2019-07-09 12:09:21 +03:00
Eladash	d57b4dc8f3	rsx: Refactor rsx_decode.h and bugfixes	2019-07-09 11:52:34 +03:00
kd-11	50736263d2	gl: Fix native pitch computation	2019-07-08 18:04:56 +03:00
Eladash	6d65d3424f	rsx: Clamp fragment shaders address	2019-07-06 20:58:18 +03:00
kd-11	ad10eb391e	vk: Reuse discarded memory whenever possible instead of recreating new objects - Memory allocations are surprisingly expensive when spammed	2019-07-03 15:52:16 +03:00
kd-11	71e809a78b	rsx: Implement dma abort in case of a reset after misprediction	2019-07-03 15:52:16 +03:00
kd-11	0f11939faf	vk: Refactor gc	2019-07-03 15:52:16 +03:00
kd-11	ae93b417ec	vk: Handle emergency commandbuffer close with dangling queries - TODO: Refactoring	2019-07-03 15:52:16 +03:00
kd-11	d69e8288ad	vk: Restructure commandbuffer submission into tagged event IDs - Tagged eventIDs can be used to safely delete resources that are no longer used - TODO: Expand gc to collect images as well - TODO: Fix the texture cache to avoid over-allocating image resources	2019-07-03 15:52:16 +03:00
kd-11	ce04a797c3	vk: Fix event signal race when speculation fails to avoid a cache miss - TODO: Proper GC for stale events	2019-07-03 15:52:16 +03:00
Malcolm Jestadt	b5d5113803	gl: Workaround slow PBO usage with Mesa -Mesa is currently fastest with GL_STREAM_COPY -See `a338dc0186` -Also see https://bugs.freedesktop.org/show_bug.cgi?id=111043	2019-07-03 11:28:29 +03:00
msuih	690cdff0d3	Minor fixes - Fix a typo in OpenAL - Fix typo in cellHttp.h - Unused variables in catch - Use 64-bit shifts - Use use_count with shared pointers, unique is depracated and getting removed - Explicitly cast boolean to int - Signed/unsigned issues with loop variables - Fix missing return statement (the code path is unreachable, but compiler wants a return) - */ ouside of comment - Fix duplicate layout name	2019-07-01 04:33:23 +03:00
msuih	d57124d075	Explicitly cast size_t to integer types	2019-07-01 04:33:23 +03:00
msuih	146e43b6ec	Do not use negative unsigned literals	2019-07-01 04:33:23 +03:00
Eladash	2bce367488	Fixup for fixup (#6153 ) * Fixup for fixup * Fix memory ordering for MTRSX volatile doesnt block reordering. * ugh	2019-06-30 12:47:42 +03:00
Eladash	43f919c04b	Fixup after #6143 (#6146 ) vm::spu max address was overflowing resulting in issues, so cast to u64 where needed. Fixes #6145. Use vm::get_addr instead of manually substructing vm::base(0) from pointer in texture cache code. Prefer std::atomic_thread_fence over _mm_?fence(), adjust usage to be more correct. Used sequantially consistent ordering in semaphore_release for TSX path as well. Improved memory ordering for sys_rsx_context_iounmap/map. Fixed sync bugs in HLE gcm because of not using atomic instructions. Use release memory barrier in lwsync for PPU LLVM, according to this xbox360 programming guide lwsync is a hw release memory barrier. Also use release barrier where lwsync was originally used in liblv2 sys_lwmutex and cellSync. Use acquire barrier for isync instruction, see https://devblogs.microsoft.com/oldnewthing/20180814-00/?p=99485	2019-06-29 18:48:42 +03:00
Eladash	1ee7b91646	Refactoring (#6143 ) Prefer vm::ptr<>::ptr over vm::get_addr. Prefer vm::_ptr/base over vm::g_base_addr with offset. Added methods atomic_t<>::bts and atomic_t<>::btr . Removed obsolute rsx:🧵:Read/WriteIO32 methods. Removed wrong check in semaphore_release. Added handling for PUTRx commands for RawSPU MFC proxy. Prefer overloaded methods of v128 instead of _mm_... in VPKSHUS ppu interpreter precise. Fixed more potential overflows that may result in wrong behaviour. Added io/size alignment check for sys_rsx_context_iounmap. Added rsx::constants::local_mem_base which represents RSX local memory base address. Removed obsolute rsx:🧵:main_mem_addr/ioSize/ioAddress members.	2019-06-29 01:27:49 +03:00
JohnHolmesII	232a35b6fc	Various small warning fixes -Indentation warnings -prevent shift overflow -This was declared extern in all contexts. Remove this for initialization -Fix main return types. OH CANADA! -Silence extraneos 'unused expression' warning -Force use return value (warning) -Remove tautological compare copy-pasta (char always < 256)	2019-06-28 01:45:29 +03:00
JohnHolmesII	948c1df969	Remove unecessary vulkan loader check var, per kd	2019-06-28 01:45:29 +03:00
JohnHolmesII	a124ec4a26	Remove braces around shader source strings (warnings)	2019-06-28 01:45:29 +03:00
JohnHolmesII	ebb1ae6408	Properly ignore SIMD macros to avoid warning	2019-06-28 01:40:52 +03:00
JohnHolmesII	23094b48bb	Fix warnings related to -Wswitch Add default cases. Move default breaks to newline Add proper handling in some instances. Add missing enums to switches	2019-06-28 01:40:52 +03:00
JohnHolmesII	be521ff0ab	Fix warnings related to parentheses	2019-06-25 20:36:32 -07:00
kd-11	9ce7b8a401	vk: Add LLVM8 warning for RADV drivers	2019-06-25 20:50:54 +03:00
kd-11	009c55fcba	vk: Fix broken layout stream on first draw call	2019-06-25 20:50:54 +03:00
kd-11	4ff77a8555	rsx: Improve balancing of the offloader thread - Use two counters to avoid atomic operations - Yield instead of sleeping because some games are very sensitive to timing	2019-06-25 20:50:54 +03:00
kd-11	8249d51aa8	vk: Optimize occlusion pool management - Do not consume a slot every draw call, instead batch as many draws as possible - Since renderpasses are dispatched per-draw-clause, keeping occlusion queries outside the renderpasses works fine - If renderpasses are reorganized, occlusion tasks will have to be reorganized again	2019-06-25 20:50:54 +03:00
kd-11	1ee675e1f4	facepalm of the year - Typo fix - This check leads to forever relocating memory if size never exceeds capacity!	2019-06-25 20:50:54 +03:00
kd-11	2b9c315374	rsx: Use rpcs3 thread construct for the offloader thread	2019-06-25 20:50:54 +03:00
kd-11	d26b25816d	rsx: Improve profiling setup - Avoid spamming QPC when not needed - Free performance when debug overlay is not enabled	2019-06-25 20:50:54 +03:00
kd-11	b893a75002	rsx: Rework RSX offloading - Use a lockless queue - Do not enqueue small transfers	2019-06-25 20:50:54 +03:00
kd-11	c32c1b0a62	gl: Minor API tweaks - Avoid spamming the driver with samplerParameter calls unless the parameters have actually changed	2019-06-25 20:50:54 +03:00
kd-11	6a32f716db	rsx: Reimplement vertex layout streaming - Remove string comparisons from the hot-path! - Use attribute streaming and push constants to avoid forcing a descriptor block copy every other draw call/pass. While this isn't so bad on nvidia cards, it makes AMD cards a slideshow.	2019-06-25 20:50:54 +03:00
kd-11	59ee74a275	rsx: Disable vertex cache if multithreaded memory access is enabled - When multithreaded RSX is enabled, the vertex cache just lowers performance - The small cost of upload is paid by the asynchronous thread, allowing RSX to work optimally	2019-06-25 20:50:54 +03:00
kd-11	0fa3bcc336	rsx: Asynchronous data transfer	2019-06-25 20:50:54 +03:00
kd-11	358169507c	rsx: Use SSE to accelerate index buffer uploads	2019-06-25 20:50:54 +03:00
kd-11	b645ebdb04	vk: Refactor device management and improve driver detection	2019-06-25 20:50:54 +03:00
kd-11	25bba9bf56	vk: API update - use KHRONOS_validation instead of LUNARG_standard_validation which is deprecated	2019-06-25 20:50:54 +03:00
kd-11	f113cfe5c0	vk: Avoid some useless memory barriers - Do format conversions only when necessary	2019-06-25 20:50:54 +03:00
kd-11	c9501b60ab	rsx: Use explicit fma for MAD emulation	2019-06-25 20:50:54 +03:00
kd-11	6be7c58fa4	glsl: Refactoring, cleanup and optimizations - Avoid generating unused code - Reduce GPR usage in emitted code	2019-06-25 20:50:54 +03:00
Lassi Hämäläinen	c963c51a60	Remove unnecessary header includes - Manually removed lot of unneeded #includes to clean code and reduce compilation time - Reordered some of the #includes to be in more logical order	2019-06-25 17:11:10 +03:00
Lassi Hämäläinen	a070a414a6	Move rsx::constants and rsx::limits to rsx_utils.h	2019-06-25 17:11:10 +03:00
Lassi Hämäläinen	e9e87b8bd9	Add missing #includes to header files - Multiple header files where missing #includes to other headers that where used in the header. Correct header was included in correct order in source files which caused everything to compile. - Added missing #includes so header files correctly include all their dependencies and fixes problems with IDEs being unable to parse headers correctly due to missing symbols	2019-06-25 17:11:10 +03:00
Lassi Hämäläinen	499035512b	Split Emu/Memory into more logical headers - Add vm_locking.h and vm_reservation.h and move relevant functions and types to these headers. - Change include order and make vm_ptr.h, vm_var.h and vm_ref.h headers usable invidually and them including vm.h instead of other way around - Because usage of vm::ptr now requires including vm_ptr.h instead of vm.h updated multiple #includes - Added additional #includes to vm_reservation.h and vm_locking to where vm::reservation_* and locking related functions are used	2019-06-25 17:11:10 +03:00
Eladash	cd0ef99df5	Fix BE endianess arch support in semaphore_406e (#6116 ) Add raw() methods for endianness support types and make use of it.	2019-06-21 19:29:49 +03:00
scribam	185fd3d257	rsx: Minor cleanup after #6055	2019-06-17 00:31:38 +03:00
Megamouse	3f00b485a0	cellMsgDialogAbort: don't call on_close and properly re-enable pads	2019-06-15 00:24:10 +02:00
kd-11	e515d9b83a	vk: Fixup for missing resource reference - Missing ref increment when using framebuffer could lead to use-after-free. How master was not crashing is surprising	2019-06-14 16:19:52 +03:00
kd-11	c90186cf35	vk: Do not use pixel_center_origin as its use is explicitly restricted by spec	2019-06-14 16:19:52 +03:00
kd-11	98156d2a2c	vk: Avoid submitting wrong sample count in overlay passes	2019-06-14 16:19:52 +03:00
kd-11	4104d7a6a1	vk: Simplify WCB heuristics and fix out-of-bounds access	2019-06-14 16:19:52 +03:00
kd-11	86119f58d6	rsx: Typo fix	2019-06-14 16:19:52 +03:00
kd-11	9d166c5bed	rsx: Force invalidate of children by issuing a resolve notification whenever the parent is written to - Fixes successive reads of an antialiased surface that is still bound between reads	2019-06-14 16:19:52 +03:00
kd-11	296e0105c4	vk: Fix WCB for antialiased memory	2019-06-14 16:19:52 +03:00
kd-11	9d0f5aedf3	vk: Add some missing barriers	2019-06-14 16:19:52 +03:00
kd-11	e4671c29a6	rsx: Fix typo - Arguments to the transform function are xxyy not xyxy	2019-06-14 16:19:52 +03:00
kd-11	8a1cf2c913	rsx: Attempt to reduce stencil load overhead for nvidia cards	2019-06-14 16:19:52 +03:00
kd-11	ca82dd7200	vk: Improve overlay passes for resolve/unreolve - Refactor overlays and resolve passes to support use of push constants instead of relying buffer map/unmap - Add support for nvidia resolve (NV is the only vendor not supporting shader_stencil_export)	2019-06-14 16:19:52 +03:00
kd-11	c655036920	rsx/fp: Ease pressure on fragment shaders when emulating clamp16 - TODO: Option to completely skip clamping in some architectures as it is not needed in most games - Mostly affects older GPUs that do not have access to native fp16	2019-06-14 16:19:52 +03:00
kd-11	5f34c0c59a	vk: Clean up WCB readbacks when resource is multisampled - Resolve image first before performing any transfer operations	2019-06-14 16:19:52 +03:00
kd-11	9d314ca4ca	rsx: Correctly count number of valid entries if there are broken entries in the cache	2019-06-14 16:19:52 +03:00
kd-11	bca5f94b3f	rsx: Add option to toggle MSAA	2019-06-14 16:19:52 +03:00
kd-11	ea8409dcfd	rsx: Re-enable optional sample-to-pixel transformation	2019-06-14 16:19:52 +03:00
kd-11	acb14320da	rsx: Fixup for resolution scaling support	2019-06-14 16:19:52 +03:00
kd-11	4a5bbba277	rsx: Enable MSAA - vk: Enable depth buffer resolve+unresolve - vk: Add AMD stenciling extension support - rsx: Temporarily disables MSAA-compatible hacks such as transparency AA - TODO: Add paths to optionally disable MSAA	2019-06-14 16:19:52 +03:00
kd-11	f6f3b40ecc	rsx: Fix AA coordinate transforms - Requires native_pitch value to take samples into account	2019-06-14 16:19:52 +03:00
kd-11	655eff29e8	rsx: Refactoring and cleanup after d3d12 separation - Remove deprecated functionality - Refactor to share code between common routines	2019-06-14 16:19:52 +03:00
kd-11	db5d56a22d	d3d12: Remove all shared code with other backends	2019-06-14 16:19:52 +03:00
kd-11	0d906d6974	rsx: Remove surface aa_mode hacks	2019-06-14 16:19:52 +03:00
scribam	13671d9684	rsx: Apply Clang-Tidy fix "modernize-loop-convert" + const when relevant	2019-06-12 15:11:52 +03:00
scribam	1e327ad31b	rsx: Apply Clang-Tidy fix "readability-avoid-const-params-in-decls"	2019-06-12 15:11:52 +03:00
scribam	370dcd9d6e	rsx: Apply Clang-Tidy fix "readability-simplify-subscript-expr"	2019-06-12 15:11:52 +03:00
scribam	0b97d12a7b	rsx: Apply Clang-Tidy fix "modernize-use-using"	2019-06-12 15:11:52 +03:00
scribam	f1e939936a	rsx: Apply Clang-Tidy fix "modernize-use-override"	2019-06-12 15:11:52 +03:00
scribam	44265aa27d	rsx: Apply Clang-Tidy fix "modernize-use-equals-default"	2019-06-12 15:11:52 +03:00
scribam	635695ac78	rsx: Apply Clang-Tidy fix "modernize-use-emplace"	2019-06-12 15:11:52 +03:00
scribam	a555504142	rsx: Apply Clang-Tidy fix "modernize-deprecated-headers"	2019-06-12 15:11:52 +03:00
scribam	cba828384d	rsx: Apply Clang-Tidy fix "modernize-pass-by-value"	2019-06-12 15:11:52 +03:00
scribam	a02a8642b0	rsx: Apply Clang-Tidy fix "modernize-make-unique"	2019-06-12 15:11:52 +03:00
scribam	b91bcdbbca	rsx: Apply Clang-Tidy fix "modernize-use-bool-literals"	2019-06-12 15:11:52 +03:00
scribam	349e7c8708	rsx: Apply Clang-Tidy fix "readability-non-const-parameter"	2019-06-12 15:11:52 +03:00
scribam	35dc98be06	rsx: Apply Clang-Tidy fix "readability-string-compare"	2019-06-12 15:11:52 +03:00
scribam	ac7e89660f	rsx: Apply Clang-Tidy fix "readability-redundant-smartptr-get"	2019-06-12 15:11:52 +03:00
scribam	801fa0113f	rsx: Apply Clang-Tidy fix "readability-inconsistent-declaration-parameter-name"	2019-06-12 15:11:52 +03:00
scribam	c9b0a4afd0	rsx: Apply Clang-Tidy fix "performance-type-promotion-in-math-fn"	2019-06-12 15:11:52 +03:00
scribam	8f2647555a	rsx: Apply Clang-Tidy fix "readability-redundant-string-init"	2019-06-12 15:11:52 +03:00
scribam	331fe01762	rsx: Apply Clang-Tidy fix "performance-for-range-copy"	2019-06-12 15:11:52 +03:00
scribam	db926ee671	rsx: Apply Clang-Tidy fix "performance-unnecessary-value-param"	2019-06-12 15:11:52 +03:00
scribam	81a3b49c2f	rsx: Apply Clang-Tidy fix "readability-container-size-empty"	2019-06-12 15:11:52 +03:00
scribam	c4667133c4	gl/vk: Add constexpr to varying_registers and sync functions between the two backends	2019-06-12 10:59:31 +01:00
scribam	65581acbf9	rsx: Use constexpr for flattening_helper::m_register_properties	2019-06-12 10:59:31 +01:00
kd-11	d361eedbec	rsx: Clean up window management code - Removes a lot of wm_event code that was used to perform window management and is no longer needed. - Significantly simplifies the vulkan code. - Implements resource management when vulkan window is minimized to allow resources to be freed.	2019-06-10 14:57:03 +03:00
kd-11	57196f0504	vk: Move frame present synchronization to the driver - Just use a semaphore and let the driver handle it instead of manual framepacing. We lose framepace control but drivers have matured in the past few years so it should work fine.	2019-06-10 14:57:03 +03:00
scribam	39fa1d7031	ci/vk: Bump Vulkan version (1.1.73.0/1.1.97.0 => 1.1.106.0) VULKAN_SDK_MIRROR removed as the server is down	2019-06-09 23:43:57 +01:00
scribam	f9ad635856	rsx: TextGlyphs optimizations	2019-06-09 23:09:11 +01:00
Nekotekina	dfd50d0185	Implement std::bit_cast<> Partial implementation of std::bit_cast from C++20. Also fix most strict-aliasing rule break warnings (gcc).	2019-06-02 23:22:16 +03:00
scribam	99c1f87289	vk: Fix memory value in comments to match with the code below	2019-06-01 22:59:23 +03:00
scribam	09c9996f31	Use empty() instead of comparing size() with 0 Recommendation from Clang-Tidy: https://clang.llvm.org/extra/clang-tidy/checks/readability-container-size-empty.html	2019-06-01 22:59:23 +03:00
scribam	bf557ea6e6	Use the more efficient character literal overload for find_first_of/find_last_of Recommendation from Clang-Tidy: https://clang.llvm.org/extra/clang-tidy/checks/performance-faster-string-find.html	2019-06-01 22:59:23 +03:00
scribam	78c7ef3039	rsx: Use clear() instead of resize(0) The result is the same but clear [1] has slightly less code than resize [2] and signals better the intent IMHO. [1] `fb7fb646fa/libstdc%2B%2B-v3/include/bits/stl_vector.h (L1495)` [2] `fb7fb646fa/libstdc%2B%2B-v3/include/bits/stl_vector.h (L934)`	2019-06-01 22:59:23 +03:00
scribam	d9d7634f8b	vk: remove duplicate condition in pipeline_props struct equal operator	2019-06-01 00:01:32 +03:00
msuih	ef587d4cdc	Limit shaderlog writing behind log_programs setting	2019-05-31 19:49:32 +03:00
kd-11	f2cac26154	rsx: Refactor out GLSLTypes from GLSLCommon to avoid warning spam due to unused functions when included in settings dialog code	2019-05-31 13:27:43 +03:00
kd-11	6e92516070	vk: Do not reset descriptors from the aux buffer when things are running slow - The aux buffer borrows its descriptors from the lagging frame, so they are still in use until the frame completes.	2019-05-31 13:27:43 +03:00
kd-11	e118c9e5da	update glslang	2019-05-30 11:48:38 +03:00
Megamouse	34964e0e4f	handle some warnings	2019-05-28 21:47:49 +02:00
kd-11	d9ab2c7104	vk: Bump shaders cache version - Pipeline properties changed with the renderpass update	2019-05-28 15:28:30 +03:00
kd-11	57eb892153	vk: Refactor framebuffers - Refactor out framebuffers from the renderer core - Use a proper cache with sorted queues for faster searching	2019-05-28 15:28:30 +03:00
kd-11	507ec8252b	vk: Refactor renderpass management - Ensures the current renderpass matches the image properties even when a cyclic reference is detected - Solves SDK debug output error spam due to mismatching layouts and renderpasses	2019-05-25 14:07:29 +03:00
Malcolm Jestadt	c348fec84b	Warn AMD linux users about potential performance loss if not using RADV	2019-05-24 17:16:29 +03:00
Malcolm Jestadt	6ab3011eef	vk: Check_window_status fixups Intel ANV has been tested and verified to work without workaround AMDVLK and the proprietary AMD driver have been confirmed to require workaround for window resizing	2019-05-24 17:16:29 +03:00
kd-11	370b9e196d	vk: Improve descriptor pool management - Add double-buffered descriptor pools to avoid use-after-free situations - Make descriptor pools more configurable - Also adds in a hack to allow renderdoc to capture properly	2019-05-22 01:18:46 +03:00
kd-11	46ba53f122	vk: Propagate more information to the driver - Pass "correct" layout to descriptors - TODO: Fix renderpass attachment descriptors which are inadvertently doing silent transitions	2019-05-22 01:18:46 +03:00
kd-11	c3b234f972	gl: Fix staging buffer size calculation	2019-05-22 01:18:46 +03:00
Malcolm	9a26c0abda	Overlays: Fix timing	2019-05-21 13:01:38 +03:00
Nekotekina	9abb303569	vm: expand reservation lock bit area to 7 bit This is minor change.	2019-05-19 17:46:55 +03:00
kd-11	8009e53642	rsx: Fix upload block range optimization - The 'max' index should take the first assigned ID; fixes problems with divisors	2019-05-19 17:33:21 +03:00
kd-11	a245d9fb24	vk: DOuble general-purpose heap allocation to 128M and add a better diagnostic message for OOM	2019-05-19 17:33:21 +03:00
kd-11	0ef7b2aaff	rsx: Use a saner model for swap queue handling - Use a simple queue to avoid redundant checks over all the contexts - Poll queue if RSX pipe is idle - Only check the queue when the frame context is dirty (after a queue operation) - Reset descriptors at the start of the frame context to avoid having to synchronize mid-frame - Fully synchronize if a descriptor reset is required mid-frame (spec compliance; also fixes flickering verts on some hardware)	2019-05-19 17:33:21 +03:00
kd-11	dc749d3975	vk: Bump max number of allocated draw calls from 4k to 16k	2019-05-19 17:33:21 +03:00
kd-11	e3f68c66d8	rsx: Use a shared sampler pool instead of relying on the drivers	2019-05-17 22:51:40 +03:00
Megamouse	edb1a32bb1	overlays: use L1 and R1 to step by 10 in the save data list	2019-05-17 20:21:23 +02:00
Megamouse	32bdd8ef7b	overlays: move some code to cpp files	2019-05-17 20:21:23 +02:00
kd-11	4037225e98	vk: Workaround for cyclic feedback loops - Transition attachments to LAYOUT_GENERAL in case of a feedback loop - Fixes appearance of garbage along polygon edges in some post-processing passes. - Also reverse this transition when rendering goes back to normal	2019-05-17 16:41:17 +03:00
kd-11	cb78522620	rsx: Fixup for uninitialized surface antialiasing mode	2019-05-16 19:25:26 +03:00
kd-11	45a13d0319	rsx: Fixup for lost aliased surfaces - Intersection routines were changed and require explicit identification of the "old surface"	2019-05-16 19:25:26 +03:00
kd-11	05eb1e9193	rsx: Fix zombie image references from inside the texture cache - Do not add locked orphans to the flush_always cache! They will not remove their cache entries as they are not bound	2019-05-16 19:25:26 +03:00
kd-11	214bb3ec87	rsx: Always initialize memory unless it is guaranteed to be wiped	2019-05-16 19:25:26 +03:00
kd-11	88290d9fab	rsx: Hack around using data regions as transfer targets	2019-05-16 19:25:26 +03:00
kd-11	4182f9984d	rsx: Propagate split section information back to the texture cache	2019-05-16 19:25:26 +03:00
kd-11	3c7d8a1099	rsx: Minor texture/surface scanning optimization - Also re-enable optimization in blit engine accidentally disabled during debugging	2019-05-16 19:25:26 +03:00
kd-11	9f0090772a	rsx: Fix write tagging when comments are transferred in by blit engine	2019-05-16 19:25:26 +03:00
kd-11	4b443be881	rsx: Fix self-intersection with previous occupant of the address being replaced	2019-05-16 19:25:26 +03:00
kd-11	b840f6da28	[WIP] rsx: Use a sane reference counting model	2019-05-16 19:25:26 +03:00
kd-11	e3cf3ab6b8	rsx: Minor fixes - Fix transfer scaling (inverted) - Fix under-estimated typeless acquisition when doing depth format scaling	2019-05-16 19:25:26 +03:00
kd-11	e02e27b2b3	rsx: Prevent out-of-bounds writes when resolving shader input textures - The target area can also have padding!	2019-05-16 19:25:26 +03:00
kd-11	1c439f6198	vk: Fix some spec violations	2019-05-16 19:25:26 +03:00
kd-11	88c20afd3a	rsx: Implement unaligned surface inheritance with hierachial contribution - Allows render targets to behave like stacked 3D views same as shader inputs are resolved - Basically implements most of 'Read Color/Depth Buffers" option for 'free'. - Allows splitting RTV/DSV resources if they are superceded by a partial surface - Also allows intersecting new resources through the surface cache for proper inheritance from other scattered data - TODO: Refactor bind_surface_as_rtt and bind_surface_as_ds to reduce asinine code duplication	2019-05-16 19:25:26 +03:00
scribam	22f61caf9f	GLTexture: add missing #pragma once directive	2019-05-12 18:32:11 +03:00
scribam	6c5ea068c9	Remove redundant semicolons Fix "-Wextra-semi" warnings	2019-05-12 18:32:11 +03:00
scribam	3623f4343f	gl/vk: clear scissor_setup_invalid bit along with scissor_config_state_dirty bit	2019-05-11 13:13:49 +03:00
eladash	7ead021aa7	rsx: Fix 3d swizzled texture to linear conversation	2019-05-08 23:48:39 +03:00
Megamouse	5141590729	overlays: add separate timestamp for the start of the d-pad interval	2019-05-06 22:00:40 +02:00
Malcolm Jestadt	fd2bc95a7b	overlays: Double dpad repeat rate	2019-05-06 22:00:40 +02:00
kd-11	9c346c92f3	gl: undo an accidental deletion	2019-05-05 13:37:55 +03:00
kd-11	2bec304cca	vk: Allow some drivers to bypass window polling if not needed	2019-05-05 13:37:55 +03:00
kd-11	6b7cd458e3	rsx: Silence some diagnostics unless compiled with debugging options	2019-05-01 15:36:21 +03:00
kd-11	1d5c52f476	rsx: Ignore stencil clear flag if the stencil write mask is disabled	2019-05-01 15:36:21 +03:00
kd-11	48cb265c2c	rsx: Bounds check on local resource for atlas merge. - Local resources can also have padded pitch dimensions and false-positives on range overlap tests	2019-05-01 15:36:21 +03:00
kd-11	63f9b8e0c6	gl/vk: Minor cleanup	2019-05-01 15:36:21 +03:00
kd-11	ec9aa74008	rsx: Fix section base offset calculation for blit_dst targets which affects confirmed memory range - Fixes flushes only writing partially to target memory	2019-05-01 15:36:21 +03:00
kd-11	4e3ec162e2	rsx: Fix broken texture cache search when flipping	2019-05-01 15:36:21 +03:00
kd-11	6feffe6ff6	rsx: Ignore transfer offsets when wrapping behaviour is expected	2019-05-01 15:36:21 +03:00
kd-11	f56a6548b0	gl: Remove workaround for AMD driver bug fixed in driver 19.4.3	2019-05-01 15:36:21 +03:00
kd-11	243df38360	rsx: Fix VP writes to CC with a MOV instruction - When moving to CC, the operation has VEC flag disabled and also temp regs disabled. Looks to be the catch-all ELSE in the selection logic.	2019-04-25 16:23:05 +03:00
kd-11	3cbccdd760	rsx: Fragment shader decompiler cleanup TODO: Investigate the _s input modifier behaviour further, in case it can avoid generating zeroes from a MAD instruction. x = MAD(+ve, -ve, -ve) with _s input modifier in BFBC expects result to be Non-zero	2019-04-25 16:23:05 +03:00
kd-11	4cd1c25729	"rsx: Ignore argument sign for SQRT operations"	2019-04-25 16:23:05 +03:00
kd-11	32396ba366	rsx: Simplify use of some mixed input functions using OPFLAGS to avoid implicit conversions	2019-04-25 16:23:05 +03:00
kd-11	f12bd8068c	rsx: Fragment decompiler fixups - Properly test for NaN and Inf when clamping down to fp16 - Optimize divsq a bit; mix(vec, vec, bvec) emits OpSelect which is what we want here, instead of component-wise selection which is much slower.	2019-04-25 16:23:05 +03:00
kd-11	abe7188acf	rsx: Proper workaround for broken DIVSQ instruction on realhw - While mul(0, nan) = nan and 0 / 0 = nan, 0 / sqrt(0) = 0 because of hw gremlins. normalize(0) is also nan so this behaviour does not work around that particular case either which makes it even more baffling.	2019-04-25 16:23:05 +03:00
kd-11	60f3059d22	rsx: Compensate for nvidia's low precision attribute interpolation - The hw generates inaccurate values when doing perspective-correct interpolation of vertex output attributes and makes the comparison (a == b) fail even when they are a fixed constant value. - Increase equality tolerance when doing comparisons in fragment shaders for NV cards only to work around this issue. - Teepo fix	2019-04-25 16:23:05 +03:00
kd-11	463b1b220d	rsx: Improve accuracy of shadow compare Ops when non-integer depth formats are used - The fixed-point D24S8 format does special Z clamping during compare which matches PS3 behaviour - D32S8 is a floating point format and comparison with Dref > 1 always fails causing black edges/borders	2019-04-25 16:23:05 +03:00
kd-11	7ad1646c2c	vk: Skip feature check if extension is not supported	2019-04-25 16:23:05 +03:00
kd-11	06a85f00d1	rsx: Shader decompiler cleanup and improvements - Improve support for float16_t by minimizing mixed inputs to functions (ambiguous overloads) - Minimize amount of downcasts in code by using opcode flags - Re-enable float16_t support for vulkan	2019-04-25 16:23:05 +03:00
kd-11	a668560c68	rsx: Use native half float types if available - Emulating f16 with f32 is not ideal and requires a lot of value clamping - Using native data type can significantly improve performance and accuracy - With openGL, check for the compatible extensions NV_gpu_shader5 and AMD_gpu_shader_half_float - With Vulkan, enable this functionality in the deviceFeatures if applicable. (VK_KHR_shader_float16_int8 extension) - Temporarily disable hw fp16 for vulkan	2019-04-25 16:23:05 +03:00
kd-11	ee319f7c13	rsx: Implement strict clamp16 operation needed for NVIDIA cards	2019-04-25 16:23:05 +03:00
eladash	6f76e34104	rsx: Fix race on clearing native_ui vs emu_requested flag	2019-04-20 01:04:41 +03:00
eladash	888cb9d673	Remove reader_lock executed in every instruction by RSX Use optimistic double check instead, use one load instruction for the check to be atomic + Read emu status once every FIFO iteration	2019-04-20 01:04:41 +03:00
eladash	f25587d24c	rsx: Write vblank semahpre, minor semaphore acquire optimization	2019-04-20 01:04:41 +03:00
Megamouse	b929c13c45	implement get_firmware_version add firmware version to the first line in the log	2019-04-16 22:13:28 +02:00
kd-11	df3b46a611	rsx: Improve texture sourcing and clipping when reverse scanning is enabled - When reverse scanning, offsets are inverted and offset value of 0 is logically equivalent to an offset of -1 - Add an explicit message if clipping happens to avoid silent errors/bugs	2019-04-12 15:36:21 +03:00
kd-11	12dc3c1872	vk: Dynamic heap management to potentially fix ring buffer overflows - Allows checking one heap type at a time, on demand - Should avoid OOM situations unless inside an uninterruptible block	2019-04-09 13:40:54 +03:00
kd-11	a4495c35b7	rsx: Fixups for swizzled texture scanning - Revert to using block metrics, but with optional per-channel decode stage for the final transfer. Much cleaner than hacking in the width to be in channels instead of blocks.	2019-04-09 13:40:54 +03:00
kd-11	a5ed30a8c0	rsx: Fixups for data cast operations via typeless transfer	2019-04-09 13:40:54 +03:00
kd-11	f04a0a2bb6	rsx: Remove some old restrictions affecting memory persistence	2019-04-09 13:40:54 +03:00
kd-11	0a604e39f1	rsx: Implement RGB655 decode	2019-04-09 13:40:54 +03:00
kd-11	cc3809fbfe	gl: Register a few more missing formats for conversion	2019-04-09 13:40:54 +03:00
kd-11	e4e86455f2	rsx: Fix temporary subresource caching behaviour - Do not cache if a gathered subresource contains a bound RTT - Change op to dynamic copy if parent is still bound	2019-04-09 13:40:54 +03:00
kd-11	3249000511	rsx: Improvements to texture scanning - Removes CPU-only transforms that broke GPU-side code. -- Channels in GPU compute are laid out in cell-order, but CPU was uploading in favorable order and compensating with swizzles. -- This leads to 2 different layouts depending on the location of the data (CPU vs GPU) - Implement R8G8_R8B8 interleaved format decode - General improvements	2019-04-09 13:40:54 +03:00
kd-11	0f7af391d7	vk: Implement copy-to-buffer and copy-from-buffer for depth_stencil formats - Allows D24S8 and D32S8 transport via typeless channels - Allows uploading and downloading D24S8 data easily - TODO: Implement optional byteswapping to fix flushed readbacks with the same method	2019-04-09 13:40:54 +03:00
kd-11	366e4c2422	rsx: Preliminary support for format conversions using typeless resolve	2019-04-09 13:40:54 +03:00
kd-11	b7470cfc1a	rsx: Tighten format checks in cache hit tests	2019-04-09 13:40:54 +03:00
kd-11	443fde760f	rsx: Blit engine clipping fixes - Do not round up sub-pixel offsets, round down instead - Do not allow incomplete sources for hw blit transfer - Reimplement src clipping (slice_h) - Check 'area' of incoming texels and correct for them before RTT lookup/transfer - Filter out incomplete targets when performing RTT lookup (1 texel or less contribution)	2019-04-09 13:40:54 +03:00
eladash	8185ef7610	rsx: Improve vblank accuracy	2019-03-31 14:57:21 +03:00
eladash	801e6114b6	rsx: Use relaxed store on fifo ctrl registers	2019-03-31 14:57:21 +03:00
kd-11	41b87cf577	rsx: Blit engine fixes - If a transfer writes to a RTT and depth mismatch happens, create a local target and the upload function will likely resolve between the two - If a surface is rejected, reset the target region!	2019-03-22 21:27:15 +03:00
kd-11	86ad204636	rsx: Rebase output region when using upload-fallback path	2019-03-22 21:27:15 +03:00
kd-11	dbc8e70ddd	rsx: Silence some compiler noise	2019-03-22 21:27:15 +03:00
kd-11	3a4e3fa53a	rsx: Fix use-after-modify condition when inserting a draw command out of order - Fixes barrier->range rebase after the insert	2019-03-22 21:27:15 +03:00
kd-11	d731c07ade	vk: Fix typeless resource management - Fixes bugs that appear with high resolution scaling	2019-03-22 21:27:15 +03:00
kd-11	adc59f9810	rsx: Fix blit transfers when texel sizes mismatch - Also refactors some bpp handling code - Simplify texture intersection test to use a normalized/uniform coordinate space - Fix broken bounds checking as well	2019-03-22 21:27:15 +03:00
kd-11	b879b32271	rsx: Fix bpp calculation taking resolution scaling into account - Do not rely on image->width(), use surface_width() instead for unscaled values - Refactor/clean GL rendertarget class a bit	2019-03-20 10:05:54 +03:00
kd-11	03fca73cf4	rsx: Fix blit intersection falling outside the available texture - Just becaue we have a hit inside the tile of interest does not guarantee that it sits inside the texture!	2019-03-20 10:05:54 +03:00
kd-11	3ef16bee47	rsx: Fix texture lookups and avoid out-of-bounds copies/transfers	2019-03-17 21:50:11 +03:00
kd-11	bb65e45614	rsx: Implement GPU acceleration for rotated images	2019-03-17 21:50:11 +03:00
kd-11	5260f4b47d	rsx: Improvements to memory flush mechanism - Batch dma transfers whenever possible and do them in one go - vk: Always ensure that queued dma transfers are visible to the GPU before they are needed by the host Requires a little refactoring to allow proper communication of the commandbuffer state - vk: Code cleanup, the simplified mechanism makes it so that its not necessary to pass tons of args to methods - vk: Fixup - do not forcefully do dma transfers on sections in an invalidation zone! They may have been speculated correctly already	2019-03-17 21:50:11 +03:00
kd-11	385485204b	vk/gl: Omit unlocked data when grabbing flip sources from texture cache	2019-03-17 21:50:11 +03:00
kd-11	74eeacd091	vk/gl: Improve memory tag sync and test - Properly pass parameters such as rsx-pitch to the surface store - Do not crash if a surface fails verification in flip, use fall-back instead	2019-03-17 21:50:11 +03:00
kd-11	1a44446250	rsx: Fix dst upload block region - The section needed starts at image origin, not transfer origin!	2019-03-17 21:50:11 +03:00
kd-11	a49a0f2a86	vk/gl: Synchronization improvements - Properly wait for the buffer transfer operation to finish before map/readback! - Change vkFence to vkEvent which works more like a GL fence which is what is needed. - Implement supporting methods and functions - Do not destroy fence by immediately waiting after copying to dma buffer	2019-03-17 21:50:11 +03:00
kd-11	85cb703633	rsx/cache: Debugging bugs introduced by the atlas coverage check - Figured out why it breaks things, ofc can't actually check for coverage when there is no proper fbo data persistence	2019-03-17 21:50:11 +03:00
kd-11	3a4083263e	rsx: Fix texture transfer when pitch does not match exactly	2019-03-17 21:50:11 +03:00
kd-11	612160a8ff	rsx: Fix zero-pitch textures - Assumption here is that only texel (0, 0) is accessible. Inline with other pitch 0 operations. - TODO: Verify pitch 0 does not advance in Y either	2019-03-17 21:50:11 +03:00
kd-11	17c49d21a5	rsx/blit: Remove workarounds/hacks added for master. Start implementation/stubs for blit engine rotations in GPU	2019-03-17 21:50:11 +03:00
kd-11	745f8f9627	rsx: Remove pointless assert	2019-03-17 21:50:11 +03:00
kd-11	1875dc3f18	gl: Fix buffer size calculations	2019-03-10 16:09:05 +03:00
kd-11	358558aaa7	cleanup and fixups	2019-03-10 16:09:05 +03:00
kd-11	04dda44225	rsx: Properly generate render target data with all parameters provided - Build-up to variable-sized framebuffers and AA implementation - Also allows accurate range calculation for our hit testing	2019-03-10 16:09:05 +03:00
kd-11	21bc6c7a87	rsx: Properly resolve data for upload when needed. - Avoids blindly reusing blit dst sections as they may contain garbage. If a section was unlocked for a flush, just discard it as its reuse introduces potential data corruption. Since the data needs to be reuploaded anyway (for now), its better to start afresh - In case of format mismatch, reset the calculated dst block - Add a bounds check to determine if data contained in an atlas is good enough for sampling the cache. If not enough data is provided, fall back to full upload	2019-03-10 16:09:05 +03:00
kd-11	9d4d3d9443	rsx: Reimplement render target intersection tests when using hw accelerated blit engine - Properly collapse memory tree when scanning in case of overlaps!	2019-03-10 16:09:05 +03:00
kd-11	f4ebcb0029	rsx: Properly decode packed renders from the type flag - Seems to occupy bits [8-9]	2019-03-10 16:09:05 +03:00
kd-11	7c379432dd	rsx: Implement proper pitch compatibility lookup - When a single row is required or is all that is available, pitch has no meaning as the coordinate space changed to 1D	2019-03-10 16:09:05 +03:00
kd-11	dccb4a4888	rsx/texture_cache: fixes to commit_framebuffer_memory	2019-03-10 16:09:05 +03:00
kd-11	b9e7b085fe	rsx/texture_cache: Fixups for local resource hit and fast-path added	2019-03-10 16:09:05 +03:00
kd-11	a80f1a6ed4	gl: Fix memory tag sampling - Also fixes a bad arg passed to glClearBuffer	2019-03-10 16:09:05 +03:00
kd-11	0395fb9955	rsx/tecture_cache: Addendum - fix data cast with scaling conversion (AA emulation) - Blit operations do format conversion automatically which is NOT what we want! - Scale onto temp buffer with similar format before performing data cast.	2019-03-10 16:09:05 +03:00
kd-11	10dc3dadee	rsx/texture_cache: Improve framebuffer memory locking when WCB/WDB is not enabled - Adds a new mode that removes non-framebuffer stuff inside framebuffer range	2019-03-10 16:09:05 +03:00
kd-11	563e205a72	rsx/texture_cache: Fix 'AA' scaling hack and restore collection template selection	2019-03-10 16:09:05 +03:00
kd-11	fa628f0ac4	rsx/surface_store: More aggressive tag sampling - Use a 5-point tap with an X pattern across the target's memory space to reduce chances of false positives - TODO: Potential false positives identified, requires some minor restructuring of surface_store	2019-03-10 16:09:05 +03:00
kd-11	3a071a9c07	rsx: Texture search rewrite - Perform a full search across all resource types as needed without taking too many shortcuts/hacks	2019-03-10 16:09:05 +03:00
kd-11	6ef9dcd62e	rsx: Handle mismatched/invalidated framebuffer sections when WCB is enabled	2019-03-10 16:09:05 +03:00
kd-11	ef071ebb6b	rsx: Synchronize surface cache and texture cache data - TODO: The whole upload_texture thing is a big hack, fix it properly	2019-03-10 16:09:05 +03:00
elad	bd259c8ae4	vulkan zcull: Fix deadlock in zcull flush waiting Block adding additional flush requests until the first ones are treated (by adding missing lock)	2019-03-08 23:44:46 +03:00
elad	fc253165e2	Correctness fix for RSXIOMem - Make RSXIOMem volatile. - Hint the compiler to check only once the address returned.	2019-03-08 23:44:46 +03:00
elad	ce8c92262d	Treat X8R8G8B8 format as A8R8G8B8 in image_in, Fixes #5510	2019-03-08 23:44:46 +03:00
eladash	d82362fa1d	Use sys_memory_allocate on rsx replayer to fix it	2019-03-05 21:23:24 +03:00
German	4c72f7c1de	Fix clear string container in CgBinaryFragmentProgram.cpp	2019-02-18 16:34:16 +03:00
Megamouse	17a5e0bc98	cellGame: add error_code	2019-02-12 21:06:10 +03:00
kd-11	19ff95da70	vk: Fix usage of VK_IMAGE_LAYOUT_GENERAL - Properly synchronize when transitioning to/from GENERAL layout. - General layout requires full pipeline dependency since its used in a 'general' sense. As such, its use is to be largely avoided.	2019-02-07 11:40:17 +03:00
kd-11	38887bc03e	gl/vk: Improvements to overlay rendering - gl: Properly initialize and manage sampler states - gl/vk: Snap overlay elements to pixel grid by aligning to pixel centers - overlays: Disable grid snapping in stb since its now handled in the backend	2019-02-05 12:15:12 +03:00
kd-11	4c593959fd	overlays/save_dialog: Layout improvements - Make detail a separate text entity as it often contains a lot of noise - Properly pad the entry if needed to avoid text sitting too close to the edge	2019-02-03 22:26:46 +03:00
kd-11	67cdec577f	overlays/util: Add support for glyph set lowering when mapping utf8 to ascii8 - Lower fullwidth glyphs to halfwidth counterparts - Lower CJK punctuation glyphs - Lower general punctuation glyphs	2019-02-03 22:26:46 +03:00
kd-11	a36d3af3b4	vk: Minor frame management improvements	2019-02-02 11:54:01 +03:00
kd-11	27af05da1a	osk: Fixup attempt for hang in close callback where a sysutil_callback fails to fire.	2019-02-02 11:54:01 +03:00
kd-11	b36cb66129	overlays: Allow use of extended ascii8 - Use custom string conversion to ensure overlay deals with extended ascii whenever possible - Improves language compatibility greatly and avoids empty spaces for unknown glyphs	2019-02-02 11:54:01 +03:00
kd-11	12990f3ca3	overlays/util: Strip extended codes from utf-16 encoded strings	2019-02-02 11:54:01 +03:00
kd-11	9e39e2d2c4	gl/vk: Fix clip region scaling for overlay elements	2019-02-02 11:54:01 +03:00
kd-11	3653c2eb0d	overlays/osk: Add support for edit text control and disabled cells - Allows to disable cells from being selectable. - Edit text control adds proper support for multiline and a functioning caret	2019-02-02 11:54:01 +03:00
kd-11	faf5221b0d	overlays: Implement edit_text control	2019-02-02 11:54:01 +03:00
kd-11	c434e0ce27	overlays/osk: Add more buttons to native dialog and other improvements - Adds all the major buttons to native dialog input options - Adds more button options for the native osk - Brighten osk cell backgrounds a bit to improve visibility	2019-02-02 11:54:01 +03:00
kd-11	9ed9d7e947	overlays/osk: Implement native osk interface	2019-02-02 11:54:01 +03:00
kd-11	9d4b19b97a	vk: Increase number of draw calls per frame for overlays to 1024 - Allows for more complex interface design	2019-02-02 11:54:01 +03:00
kd-11	f47d3a761b	vk: Hotfix for fullscreen not working on non-windows platforms	2019-02-01 00:22:11 +03:00
kd-11	09a8f7ae53	vk: Use FIFO mode for vsync - Avoids tearing and also hides some driver bugs causing fullscreen bugs with mailbox mode	2019-01-31 21:53:02 +03:00
kd-11	3bfa564ef8	vk/windows: Try to keep msq thread from ever stopping - NVIDIA drivers hook into the msq before our nativeEvent handler. This means NV is aware of events before rpcs3 is aware of them and sometimes stops until a new event is triggered. If rpcs3 is inside a driver call at this time, the system will deadlock since the driver waits for msq which waits for the renderer which waits for the driver. - Use explicit hook management to control window events - Add fence timeout to attempt detection of surface loss events	2019-01-31 21:53:02 +03:00
eladash	6f770c8e35	Fix potential crash in begin_occlusion_query() while closing the Emu	2019-01-30 18:44:29 +03:00
kd-11	660bfeabae	gl: Fixup - inline arrays	2019-01-25 14:34:22 +03:00
kd-11	fa9b448686	vk: Spec fixups - Disable DEPTH<->RGBA typeless transfers for now as they require a lot more work to work for all vendors - Do not allow switching layouts to UNDEFINED/PREINITIALIZED formats	2019-01-25 14:34:22 +03:00
kd-11	2163a59649	rsx: Typo fix	2019-01-25 14:34:22 +03:00
kd-11	521969bcc3	gl: Remove GL_R 'format'. There is no GL_R format, it part of the S-T-Q-R enums for texture coordinate space	2019-01-25 14:34:22 +03:00
kd-11	5a4bea8c4f	gl: Blit fixup - Typo fix. I meant to disable scissor test, not stencil test - Also clean up and simplify/optimize the core logic	2019-01-25 14:34:22 +03:00
kd-11	7e33cdcb08	rsx: simple_array<T> improvements - Implement move and copy ctors	2019-01-25 14:34:22 +03:00
kd-11	fb778e4821	rsx: Reimplement attrib divisor	2019-01-25 14:34:22 +03:00
kd-11	736415fcd9	rsx/fp: Detect broken/NOP shaders automatically - Do not compile body if the shader is of no consequence, leave as a passthrough shader	2019-01-25 14:34:22 +03:00
kd-11	6fdc0fd7f0	rsx: Reimplement MSAA transparency - Apply dither to edges that almost fail the straight-up alpha test - Significantly improves alpha tested geometry far from the camera - Also removes blend factor overrides/hacks as they give incorrect results due to background bleeding	2019-01-25 14:34:22 +03:00
kd-11	10a17feda2	rsx: Avoid potential deadlock in FIFO_ctrl	2019-01-25 14:34:22 +03:00
kd-11	7eec702c6d	gl: Fix silly regression with blit dst resource readback	2019-01-25 14:34:22 +03:00
kd-11	8093c9b573	rsx: Disable rtt side-effects when async compilation is ongoing. Only real renders should promote buffer state from underined to drawn, otherwise keep previous contents intact.	2019-01-25 14:34:22 +03:00
kd-11	417a2e6731	rsx: Refactor index buffers - Index offset is ignored anyway and only used to calculate vertex attribute divisor index - Specialized optimization for untouched xfer without primitive restart	2019-01-25 14:34:22 +03:00
Megamouse	8d5d44141e	rsx/Qt: fix some undefined behavior in progress_dialog CallAfters	2019-01-22 12:04:01 +03:00
eladash	688d5a9919	rsx: Fix unknown vertex base types Clamp vertex type field into 3-bits instead of 4-bit value Case 0 is UB256	2019-01-21 22:28:20 +03:00
Nekotekina	59e0296281	cellMsgDialog: fix error spam on CELL_OK	2019-01-18 16:49:17 +03:00
Nekotekina	a419e98acb	Move PPU and shader cache New hash-based location (already used for SPU) Bump PPU cache version, improve naming and decrease size Remove fs::get_data_dir Disable boot.elf cache	2019-01-14 01:24:05 +03:00
Nekotekina	bd9131ae1c	Implement fs::get_cache_dir Win32: equal to config dir for now Linux: respect XDG_CACHE_HOME if specified OSX: possibly incomplete	2019-01-13 14:45:36 +03:00
eladash	bc27f5f75c	Implement invalid NV4097_NOTIFY context handling	2019-01-13 12:59:00 +03:00
Megamouse	d9d5f45e9e	rsx/input: fix rsx replay	2019-01-10 13:05:48 +01:00
kd-11	52ac0a901a	rsx: improve memory coherency - Avoid tagging and rely on read/write barriers and the dirty flag mechanism. Testing is done with a weak 8-byte memory test - Introducing new data when tagging breaks applications with race conditions where tags can overwrite flushed data	2019-01-06 10:44:40 +03:00
kd-11	89c9c54743	rsx: Minor hot-fix - Pitch 0 makes sense if width == 1 and height == 1	2019-01-06 10:44:40 +03:00
kd-11	95245bdd83	rsx: Improve ARGB8->D24S8 casting - Set up partial transfers - Force clear of target before starting the transfer	2019-01-06 10:44:40 +03:00
kd-11	475cc99117	rsx: Fix dirty flag reset after a partial attachment initialization - D24S8 targets have 2 aspects that are dealt with separately; Forcefully initialize the remaining data if a partial init is done. Its 'free' anyway - It seems that the stencil mask matters when clearing unlike the depth mask and color mask	2019-01-06 10:44:40 +03:00
kd-11	c80c7f06bb	rsx: Typo fix - This silly typo broke the flip improvements in the GT fixes PR	2019-01-06 10:44:40 +03:00
kd-11	2a62fa892b	rsx: Texture cache refactor - gl: Include an execution state wrapper to ensure state changes are consistent. Also removes a lot of required 'cleanup' for helper methods - texture_cache: Make execition context a mandatory field as it is required for all operations. Also removes a lot of situations where duplicate argument is added in for both fixed and vararg fields - Explicit read/write barrier for framebuffer resources depending on usage. Allows for operations like optional memory initialization before reading	2019-01-06 10:44:40 +03:00
kd-11	0f64583c7a	rsx: Reimplement pitch lookup - Remove the required_xxx_pitch constraint as it makes no sense. The pitch controls what can be written per line. - It is possible to have a huge surface width but only render to a small region at the beginning and have a smaller pitch than can fit the surface (NFS carbon)	2019-01-06 10:44:40 +03:00
kd-11	1ffadbe086	rsx: Reorganize write barrier implementation (either clear or memory barrier)	2019-01-06 10:44:40 +03:00
kd-11	9c45ce6d37	vk: Reimplement typeless memory allocation to handle resolution upscaling	2019-01-06 10:44:40 +03:00
kd-11	d8e45c86e6	vk: Remove old useless hack that interferes with memory inheritance	2019-01-06 10:44:40 +03:00
kd-11	3be4b474d9	rsx: Handle rsx-self-tripping in draw call and triggering invalid invalidation - If draw call resources consume memory that intersects with NA parts of the texture cache, we get a framebuffer test mismatch. This mismatch is false and happens because the thread has not yet reached the point of relocking the pages	2019-01-06 10:44:40 +03:00
kd-11	a95a44cf66	rsx: Strictness cleanups - Also account for variable pitch textures (swizzled scan)	2019-01-06 10:44:40 +03:00
kd-11	474d0f61a2	minor typo fix	2019-01-06 10:44:40 +03:00
kd-11	362eea09a1	whitespace fix only	2019-01-06 10:44:40 +03:00
kd-11	15d5507154	rsx: Rewrite memory inheritance transfers - Implicitly invoke a memory barrier if actively reading from an unsynchronized texture - Simplify memory transfer operations - Should allow more games to work without strict mode	2019-01-06 10:44:40 +03:00
kd-11	97704d1396	rsx: Fix texture size calculations	2019-01-06 10:44:40 +03:00
kd-11	50c07833e4	rsx: Do not force upload for missing data - TODO: Finish implementing GPU RCB for mem-sync - TODO: Refactor mem-sync	2019-01-06 10:44:40 +03:00
kd-11	6d932b042b	vk: bump max number of compute jobs from 120 to 1024 - It is possible without bugs to have a very high number of compute invocations.	2019-01-06 10:44:40 +03:00
kd-11	64a8829614	rsx: Minor cleanup	2019-01-06 10:44:40 +03:00
kd-11	15488eb247	rsx: Avoid unnecessarily touching framebuffer memory - Do not bind companion framebuffer when clearing single aspect; let the contest mechanism sort it out instead - Do not prematurely tag framebuffers, instead only do so at write-confirmation time. Should avoid false tagging if setup does not allow a render to occur.	2019-01-06 10:44:40 +03:00
kd-11	a13986ec5c	vk: Spec fixups - Forgot to update descriptor pool init sizes over time - Also clamp swapchain resources to allowable surface extents	2019-01-05 21:31:12 +03:00
Megamouse	bb464b0b64	fix some warnings	2019-01-05 04:03:18 +01:00
Megamouse	6f7b25de90	implement CELL_PAD_INFO_INTERCEPTED	2019-01-02 15:45:51 +01:00
Megamouse	8ad14c4ada	Overlays: fix input loop when no controllers are detected	2019-01-02 15:45:51 +01:00
eladash	db784556aa	rsx: Evaluate cond render test at set_render_enabled	2018-12-30 15:04:59 +01:00
eladash	568206d11a	Fix rsx capture (again)	2018-12-30 15:04:59 +01:00
Jan Beich	8d308bb4c0	overlays: don't use /proc on BSDs as it may not be mounted	2018-12-29 18:07:45 +03:00
kd-11	9c46386dd4	rsx: Check av configuration when selecting display buffers! - Some applications have mismatch between video output configuration and display buffer sizes	2018-12-24 09:05:19 +03:00
kd-11	7555be232f	rsx/vp: Fix double dst commands - Test the vec_result mask before assigning to actual output Sometimes, VEC op is used to write to Rx, and SCA op is used to write to o[x]!	2018-12-24 09:05:19 +03:00
kd-11	10d96a60f1	rsx/capture: Force flip if no flip event was recorded	2018-12-24 09:05:19 +03:00
kd-11	f48abde14b	rsx: Fixups for immediate rendering mode - Immediate mode is isolated from the rest of the vertex configuration - TODO: Verify register behaviour when immediate mode is used Check if per-primitive const register values are supported (likely are)	2018-12-24 09:05:19 +03:00
kd-11	4b79ef1ad9	rsx: Implement stencil mirror views - Implements a mirror view of D24S8 data that accesses the stencil components. Finishes the implementation of TEX2D_DEPTH_RGBA as the stencil component was previously missing from the reconstructed data - Add a few missing destructors Image classes are inherited a lot and I forgot to make the dtors virtual	2018-12-24 09:05:19 +03:00
kd-11	696b91cb9b	rsx: Reimplement conditional execution in shaders - Per-channel conditional execution introduces RAW hazards all over the place - Its cheaper to process both branches and select between the two - Also improves ShaderVariable functionality to allow functionality such as match_size and taking complex variables as inputs	2018-12-24 09:05:19 +03:00
kd-11	c75749f8ce	rsx: fix flip logic when grabbing output from the surface cache	2018-12-24 09:05:19 +03:00
Megamouse	bc3ab7a9d9	Input: Enable In-Game Pad Config Reset	2018-12-17 19:41:18 +01:00
vit9696	5a40c1802b	Support macOS bundling for binary distribution	2018-12-16 18:17:21 +03:00
eladash	c50d459b1e	cleanup rsx fifo debugger command display	2018-12-15 19:40:18 +03:00
eladash	098d634328	rsx fifo: Fix call cmd offset mask highest 3 bits are masked according to tests, also filter certainly invalid jumps with offset higher than max	2018-12-15 19:40:18 +03:00
eladash	c2aa10cccd	reduce register_pair container	2018-12-15 19:40:18 +03:00
eladash	45ed58cdaf	Fix rsx capture replay Allow to capture non-increment cmd flag that was missing in command.reg	2018-12-15 19:40:18 +03:00
eladash	87988e9da8	rsx fifo: Stability improvements * Restore stack in fifo error handling * Update get register after the cmd execution * Fix put pause in the middle of command * Add restore points when branching to self * Precise nopcmd detection * Test all invalid cmds for early treatment of queue corruption	2018-12-15 19:40:18 +03:00
eladash	835a552d8d	rsx: Implement cellGcmSetNotify	2018-12-15 19:40:18 +03:00
eladash	415b995a54	log rsx get ctrl	2018-12-15 19:40:18 +03:00
eladash	8cbaa8627c	Do not rely on cellPadInit in native ui	2018-12-15 13:51:16 +01:00
Rui Pinheiro	54bfe2e102	Add log warning on slow flush path	2018-12-11 22:37:10 +03:00
Rui Pinheiro	18b9ee4541	Reimplement overlapping fbo "hack" To avoid the need (and performance hit) of Read Color/Depth Buffers, we may not invalidate overlapping fbos inside lock_memory_region unless they are guaranteed to be superseded by the new one. This avoids e.g. issues with overblooming, among others.	2018-12-11 22:37:10 +03:00
Rui Pinheiro	5ab7296665	Fix xcode build	2018-12-11 22:37:10 +03:00
Rui Pinheiro	bcdf91edbb	Misc. Texture Cache fixes	2018-12-11 22:37:10 +03:00
Rui Pinheiro	9d1cdccb1a	Implement dedicated texture cache predictor	2018-12-11 22:37:10 +03:00
Rui Pinheiro	af360b78f2	Texture cache section management fixups Fixes VRAM leaks and incorrect destruction of resources, which could lead to drivers crashes. Additionally, lock_memory_region is now able to flush superseded sections. However, due to the potential performance impact of this for little gain, a new debug setting ("Strict Flushing") has been added to config.yaml	2018-12-11 22:37:10 +03:00
Nekotekina	476090a747	Detach VBlank and RSX Decompiler threads Should fix exception handling in RSX Thread	2018-12-04 23:41:54 +03:00
eladash	45942c4962	Fix segfault when scaled image dimension is less than clip's	2018-12-04 13:01:29 +03:00
eladash	fa5652fceb	rsx image_in: Implement negative scaling	2018-12-04 13:01:29 +03:00
eladash	ce500c75c4	throw exceptions in case of invalid/unknown operations in image_in	2018-12-04 13:01:29 +03:00
eladash	6ecf2fb3d0	rsx: default lv2 semaphore context + dma_4097 extracted from vsh	2018-12-04 13:01:29 +03:00
eladash	28e4a9e0d0	rsx image_in: Fix in_pitch 0 The hw doesnt fix pitch, when specifying src pitch 0 it copies the same pixels line to dst. keep in mind out_pitch = 0 is not allowed in image_in. Same goes for buffer_notify, though it allows out_pitch to be 0.	2018-12-04 13:01:29 +03:00
eladash	d1d3ac984e	rsx image_in: Fix src size calculation when in_pitch != line_lengh	2018-12-04 13:01:29 +03:00
eladash	0a1da14a15	rsx image_in: remove clip h and w hack If clip region is empty, dont execute	2018-12-04 13:01:29 +03:00
eladash	4ddafc481e	remove unreachable code	2018-12-04 13:01:29 +03:00
eladash	b48a4b6459	rsx-capture: reduce capture size * Dont bother capturing 'destination' blocks with no data. instead premap all main memory to ensure allocated * Capture zcull and tile state as their compressed gcm forms * Fix index array capturing, ignore empty sets * hle gcm: Fix byteswaping in cellGcmSetZcull	2018-12-04 13:01:29 +03:00
NicknineTheEagle	32059bfaa2	Properly get PARAM.SFO and icons for C00 games (#5370 ) * Added a helper function for fetching game's PARAM.SFO path This should properly get SFO path for unlocked C00 games * Normalized line endings * Refresh game list after installing a RAP file	2018-12-04 01:46:01 +03:00
kd-11	a56ba737b5	vk: Silence log spam	2018-12-03 20:01:23 +03:00
kd-11	504ab5a6d4	rsx: Minor cleanup to silence stupid compiler warnings	2018-12-03 20:01:23 +03:00
kd-11	f4c28eceef	rsx: Fix null renderer	2018-12-03 20:01:23 +03:00
kd-11	9d0042f509	rsx: Fixup for the flattener - Reset the flattener before use - Better detection of FIFO misalignment	2018-12-03 20:01:23 +03:00
kd-11	ec768afbd9	rsx: Flip workarounds for applications that flip via syscall - Do not assume flip marks end-of-frame if executed via syscall - Also disables skip_frame for these applications as there is no frame boundary - NOTE: QUEUE_HEAD cannot be relied on as it is seemingly possible to flip the same head and not need to queue it	2018-11-30 23:51:25 +03:00
kd-11	2168159d03	gl: Fix flip regression - Restore graphics state after flip (including active fbo) because flip can be made through a syscall	2018-11-30 23:51:25 +03:00
kd-11	b96ed5cd4e	gl: Do not rely on driver statistics for s3TC textures; they are inconsistent.	2018-11-30 23:51:25 +03:00
kd-11	f1c3b46d60	rsx: Fixup - undo vertex cache 'improvements'	2018-11-30 23:51:25 +03:00
kd-11	5b6e1420f3	rsx: Pipeline barriers fixed up - Ensure barriers are invoked even if no draw occurs! -- Ensures that deferred commands are executed eventually	2018-11-30 23:51:25 +03:00
kd-11	8a186bb97e	rsx: Fix insertion of execution barriers - Ignore barriers inserted after BEGIN but before any draw commands are emitted - Properly process tail barriers inserted before END but after draw commands are submitted - Ignore execution barriers with no effect (same register value written)	2018-11-30 23:51:25 +03:00
kd-11	1d19f71a46	rsx: Re-enable fifo error reset	2018-11-30 23:51:25 +03:00
kd-11	718a04c84f	fixup: Clear disabled attrib entries	2018-11-30 23:51:25 +03:00
kd-11	833c25894f	[WIP] rsx: Rebase cleanup	2018-11-30 23:51:25 +03:00
kd-11	5193c99973	rsx: Enable dynamic FIFO preprocessing - Tries to detect when FIFO preprocessing is beneficial and only enables optimizations if the benefit outweighs the cost - Current threshold is at least 500 draw calls saved at over 2000 draw calls to justify the overhead - TODO: More tuning for other CPUs	2018-11-30 23:51:25 +03:00
kd-11	7b065d7781	rsx: Fixup; input attributes blob decoding - Use an unstructured blob and index into the vec4 structures to extract the real data	2018-11-30 23:51:25 +03:00
kd-11	846daadd5d	rsx: Fixups - Improve vertex attribute layout format. Allows for full 16-bit attribute divisor - Use actual pitch when declaring framebuffer rsx pitch instead of register value in case of swizzle? rendering	2018-11-30 23:51:25 +03:00
kd-11	2e32777375	rsx: Scrap the prebuffered queue approach - Basically starting over - The cost of making command copies into the queue has a measurable impact	2018-11-30 23:51:25 +03:00
kd-11	9deecd506a	fixup: It is possible for NOP commands to contain other garbage	2018-11-30 23:51:25 +03:00
kd-11	26a56ef1f1	vk: Spec compliance. - TODO: Implement push_constants path instead of copy + bind descriptor sets	2018-11-30 23:51:25 +03:00
kd-11	d6b4440ef9	gl: Separate vertex env from program env	2018-11-30 23:51:25 +03:00
kd-11	435afcb865	rsx: Fix fifo draw barriers	2018-11-30 23:51:25 +03:00
kd-11	2d88e41583	rsx: Fix some checks when using inlined array rendering	2018-11-30 23:51:25 +03:00
kd-11	54ec363e88	rsx: Critical pipeline fixes - Fix scissor and viewport binding behavior - Fixes recovery if empty scissor is specified and then 'fixed' later - Optimizes state binding a bit	2018-11-30 23:51:25 +03:00
kd-11	1ad76ad331	rsx: Restructure programs - Also re-enable pipeline optimizations	2018-11-30 23:51:25 +03:00
kd-11	b0a6b72ce8	rsx: Optimizations - Replace a few more vectors with simple_array<T> - Avoid unnecessary string comparisons in backends. We already know referenced textures from the program analysers!	2018-11-30 23:51:25 +03:00
kd-11	677b16f5c6	rsx: Fixups - Also fix visual corruption when using disjoint indexed draws - Refactor draw call emit again (vk) - Improve execution barrier resolve - Allow vertex/index rebase inside begin/end pair - Add ALPHA_TEST to list of excluded methods [TODO: defer raster state] - gl bringup - Simplify - using the simple_array gets back a few more fps :)	2018-11-30 23:51:25 +03:00
kd-11	e01d2f08c9	rsx: Refactor FIFO - Removes fifo structures from common RSXThread - Sets up a dedicated FIFO controller - Allows for configurable queue optimizations	2018-11-30 23:51:25 +03:00
eladash	37b6afaf2c	rsx: inlined array stride fix	2018-11-11 23:17:07 +03:00
eladash	75221a6078	rsx: Fix inlined vertex array validation	2018-11-04 22:57:18 +03:00
eladash	fb30c8a937	rsx enums: fix typos	2018-10-30 22:33:59 +03:00
eladash	4069470585	rsx-debugger: ignore invalid cmds basically ignore all non method cmds when scrolling to the next command, not only branches.	2018-10-30 22:33:59 +03:00
Megamouse	d56c85fe01	RSX/Capture: fix filePath and remove strict mode check (#5283 ) - Fixes regression introduced by kd-11 when merging in jarves' flip rework.	2018-10-27 13:06:50 +03:00
eladash	5ee351234c	rsx-capture: unbreak	2018-10-23 18:02:03 +03:00
elad	6829fa0286	rsx: Improve inlined arrays (#5248 ) * rsx: Implement register reads in inlined arrays * rsx: Check for disabled streams in inlined arrays	2018-10-20 16:00:53 +03:00
Nekotekina	1b37e775be	Migration to named_thread<> Add atomic_t<>::try_dec instead of fetch_dec_sat Add atomic_t<>::try_inc GDBDebugServer is broken (needs rewrite) Removed old_thread class (former named_thread) Removed storing/rethrowing exceptions from thread Emu.Stop doesn't inject an exception anymore task_stack helper class removed thread_base simplified (no shared_from_this) thread_ctrl::spawn simplified (creates detached thread) Implemented overrideable thread detaching logic Disabled cellAdec, cellDmux, cellFsAio SPUThread renamed to spu_thread RawSPUThread removed, spu_thread used instead Disabled deriving from ppu_thread Partial support for thread renaming lv2_timer... simplified, screw it idm/fxm: butchered support for on_stop/on_init vm: improved allocation structure (added size)	2018-10-19 22:22:35 +03:00
elad	623f1b35f6	rsx_capture/gcm: Fix tile binding (#5246 ) * gcm: Fix tile offset setting highest bit signifyies location, so ignore that while reading the offset. * rsx-capture: Fix tile binding fixes division by zero when dividing by pitch when the tile is not bound. * rsx-capture: Fix zcull binding	2018-10-12 19:05:08 +03:00
eladash	83b6c98563	rsx: Fix u16 index arrays overflow Force u32 index array destinations to avoid overflows when adding vertex base index.	2018-10-08 16:39:47 +03:00
eladash	e361e0daa6	rsx: Fix restart index check for u16 index arrays Dont ignore upper bits of the restart index with u16 types	2018-10-08 16:39:47 +03:00
Megamouse	49e5212a8f	RSX/Overlays: Add option for japanese button layout	2018-10-03 23:08:33 +02:00
Megamouse	76da3fa907	RSX/Overlays: don't press buttons on every iteration	2018-10-03 21:37:05 +02:00
Megamouse	9693d1c3a3	RSX/Overlays: formatted comments	2018-10-03 21:37:05 +02:00
eladash	348db050ae	rsx: Fix texture height read	2018-10-03 20:57:46 +03:00
eladash	62f97f2e5f	rsx: Fix default texture dimensions haha	2018-10-03 20:57:46 +03:00
eladash	fa723f6dc4	rsx: Fix texture depth read	2018-10-03 20:57:46 +03:00
eladash	a92ae827c1	rsx: Remove texture mipmap hack	2018-10-03 20:57:46 +03:00
eladash	6586090307	rsx: Remove texture size hack	2018-10-03 20:57:46 +03:00
eladash	eacd1b8f13	rsx: Remove texture address hack	2018-10-03 20:57:46 +03:00
Nekotekina	da6ce80f4f	Make vm::get_super_ptr return contiguous memory Cleanup RSX code complexity	2018-09-27 23:37:13 +03:00
eladash	72ba062b1a	rsx: Fix pfifo ret opcode	2018-09-27 17:47:32 +03:00
eladash	a47ebad24c	rsx: Fix pfifo nop cmd	2018-09-27 17:47:32 +03:00
Nekotekina	306f95a9ae	New named_thread template (preview) Old class named_thread renamed to old_thread It's too hard to move in a single commit	2018-09-27 14:04:16 +03:00
kd-11	bdf85ed900	rsx: Fixup for vertex attrib parsing - POS does not have to be fetched from ATTR[0] - Confirmed with UC1 that uses WEIGHT for positions - At least one POS stream has to exist to feed the position attribute which cannot repeat for a single triangle	2018-09-26 19:41:50 +03:00
kd-11	6a9f234dc7	rsx: Fixup flip behaviour - handle_emu_flip is very heavy, only fire	2018-09-26 19:41:50 +03:00
eladash	f056ef7a80	rsx-capture: Save initial method registers state	2018-09-26 17:55:08 +03:00
eladash	5b0ed1e2eb	rsx-capture: Map dynamically rsx data injection addresses	2018-09-26 17:55:08 +03:00
eladash	8e268aedc7	rsx-capture: Rewrite FIFO commands allocation	2018-09-26 00:58:24 +03:00
kd-11	f72157bcec	rsx: Fix vertex attrib parsing	2018-09-25 22:03:35 +03:00
kd-11	dab30c0051	rsx: Disable predictions if 50% of predictions are wrong - This happens often in loading screens where the memory usage pattern is often randomized by loading in of assets	2018-09-24 21:19:38 +03:00
kd-11	a3d44b5e1f	rsx: Cleanup changes for the flip patch	2018-09-24 16:44:02 +03:00
Jake	699eadc84f	rsx: Move render flip from rsx queue command to flip command	2018-09-24 16:44:02 +03:00
Rui Pinheiro	35139ebf5d	Texture cache cleanup, refactoring and fixes	2018-09-24 15:26:40 +03:00
Rui Pinheiro	f3029b2b42	Change Cell->RSX map/unmap notifications This allows for further flexibility on the RSX side, allowing us to fix some bugs and crashes in later commits.	2018-09-24 15:26:40 +03:00
eladash	06572c6011	rsx: Fix vertex count if all the streams are disabled	2018-09-24 13:25:05 +03:00
eladash	e0a676a3fe	rsx: Fix vertex arrays fetch with inlined draws	2018-09-24 13:25:05 +03:00
eladash	e8474145a5	rsx: Remove shader address verification this came from a misunderstanding of the register's use	2018-09-24 13:25:05 +03:00
kd-11	dafc914bcc	rsx: temporary hack - Removes all use of valid_count as a metric until the new refactor is merged	2018-09-21 16:32:23 +03:00
kd-11	2b6e6a9ae9	gl: Fix problems with framebuffer reuse - Matching attachments with resource id fails because drivers are reusing handles! - Properly sets up stale fbo ref counting and removal - Properly sets up resource reference test with subsequent removal to avoid using a broken fbo entry	2018-09-21 16:32:23 +03:00
kd-11	fc486a1bac	rsx: Preserve memory order when doing flush - Orders flushing to preserve memory at all cost - Avoids false positive where flushing overlapping sections can falsely invalidate another with head/tail test	2018-09-21 16:32:23 +03:00
kd-11	23dc9d54e3	rsx: Fix flip source selector	2018-09-21 16:32:23 +03:00
kd-11	a21bdb9f45	rsx; blit engine fixes - Forcefully downloads and reuploads data from the CPU in case of unexpected overlaps - Properly detect correct size of newly created blit targets - Remember to clear any existing views when changing the default component map!	2018-09-21 16:32:23 +03:00
eladash	1a6c819176	cellgcm: Fix SET_REFERENCE initial value	2018-09-20 01:05:40 +03:00
eladash	e6b68b260a	rsx: Improve FIFO mem faults handling increase the delay between faults, reduce log spam by allowing the messages to stack up	2018-09-20 01:05:40 +03:00
eladash	a8ea576b22	rsx/cellgcm: Implemet initialization registers reset	2018-09-20 01:05:40 +03:00
kd-11	d6dc1493cb	rsx/overlays: Implement blur, darkening and ability to disable custom background	2018-09-18 16:24:13 +03:00
kd-11	9f61fb5a78	overlays: Allow custom background for message dialog	2018-09-18 16:24:13 +03:00
Lassi Hämäläinen	7aef811ff7	CMake: Refactor CMake build (#5032 ) * CMake: Refactor build to multiple libraries - Refactor CMake build system by creating separate libraries for different components - Create interface libraries for most dependencies and add 3rdparty::* ALIAS targets for ease of use and use them to try specifying correct dependencies for each target - Prefer 3rdparty:: ALIAS when linking dependencies - Exclude xxHash subdirectory from ALL build target - Add USE_SYSTEM_ZLIB option to select between using included ZLib and the ZLib in CMake search path * Add cstring include to Log.cpp * CMake: Add 3rdparty::glew interface target * Add Visual Studio CMakeSettings.json to gitignore * CMake: Move building and finding LLVM to 3rdparty/llvm.cmake script - LLVM is now built under 3rdparty/ directory in the binary directory * CMake: Move finding Qt5 to 3rdparty/qt5.cmake script - Script has to be included in rpcs3/CMakeLists.txt because it defines Qt5::moc target which isn't available in that folder if it is included in 3rdparty directory - Set AUTOMOC and AUTOUIC properties for targets requiring them (rpcs3 and rpcs3_ui) instead of setting CMAKE_AUTOMOC and CMAKE_AUTOUIC so those properties are not defined for all targets under rpcs3 dir * CMake: Remove redundant code from rpcs3/CMakeLists.txt * CMake: Add BUILD_LLVM_SUBMODULE option instead of hardcoded check - Add BUILD_LLVM_SUBMODULE option (defaults to ON) to allow controlling usage of the LLVM submodule. - Move option definitions to root CMakeLists * CMake: Remove separate Emu subtargets - Based on discussion in pull request #5032, I decided to combine subtargets under Emu folder back to a single rpcs3_emu target * CMake: Remove utilities, loader and crypto targets: merge them to Emu - Removed separate targets and merged them into rpcs3_emu target as recommended in pull request (#5032) conversations. Separating targets probably later in a separate pull request * Fix relative includes in pad_thread.cpp * Fix Travis-CI cloning all submodules needlessly	2018-09-18 13:07:33 +03:00
elad	0d4f0f95cc	rsx: Clamp texture offsets (#5137 ) * rsx: Clamp texture offsets * rsx: Remove texture location check/hack	2018-09-17 10:55:17 +03:00
kd-11	16dcbe8c74	rsx/vp: Fix ARL opcode properly - NOTE: The address swizzle index is only for use as src. The address registers are only used one channel at a time. - When the destination of ARL, the encoding is the same as the other temp registers	2018-09-15 11:57:06 +03:00
elad	d24f9194f7	typo fix shader's location is decremented by one to match cellGcm's constants.	2018-09-13 16:49:58 +03:00
eladash	b9ad578b00	rsx: Add a default shader address state	2018-09-13 16:49:58 +03:00
Dzmitry Malyshau	d28c0d16f8	[vk] don't ask for VK_FORMAT_UNDEFINED support	2018-09-13 16:04:50 +03:00
scribam	4cb98014a2	rsx: tiny zcull optimizations	2018-09-13 12:43:40 +03:00
eladash	efbd77deb4	rsx: dont silently ignore null shader address	2018-09-12 00:40:20 +03:00
Dzmitry Malyshau	7c4693e271	[vk] Throw an exception on using a repder pass that the driver doesn't support	2018-09-11 23:44:56 +03:00
Dzmitry Malyshau	89ddeda24a	[vk] check for attachment format properties before pre-creating render passes	2018-09-11 23:44:56 +03:00
scribam	c8c8ce5e9e	rsx: add virtual destructor for default_vertex_cache	2018-09-10 19:59:37 +03:00
kd-11	f413996362	rsx: Minor texture cache fixes - Retag resources reprotected under flush_always rules - Properly check for blit resource fitting taking into account format mismatch, pitch mismatch and typeless transfers	2018-09-10 15:43:28 +03:00
Nekotekina	ed9fb8405b	Move rotate/cntlz/cnttz helpers to Utilities/asm.h	2018-09-08 00:32:04 +03:00
Dzmitry Malyshau	4e09573937	[vk] Check for BGRA8 linear image to support blitting	2018-09-07 22:02:02 +03:00
Dzmitry Malyshau	b454dde871	[vk] fix cubic view construction	2018-09-07 19:11:34 +01:00
Dzmitry Malyshau	27474316fd	Add missing virtual desctructors (#5094 )	2018-09-07 14:35:40 +03:00
kd-11	66610a28af	rsx/common: Clean up shared glsl header to minimize string concat operations	2018-09-06 21:11:11 +03:00
kd-11	346b97f871	rsx: Preserve fog coordinate across shader stages - The x value contains the VP output value interpolated across primitive surface - The y coordinate contains the fog fraction according to the selected fog formula	2018-09-06 21:11:11 +03:00
scribam	343656f66d	cleanup: remove unnecessary return and namespace declaration	2018-09-06 13:15:59 +03:00
scribam	2834c88de7	cleanup: remove intermediate const char* variables	2018-09-06 13:15:59 +03:00
scribam	f83d381e1e	clang-tidy: use nullptr	2018-09-06 13:15:59 +03:00
scribam	c4cff9b543	clang-tidy: remove redundant "apply_swizzle_remap" declaration	2018-09-06 13:15:59 +03:00
scribam	d7bb59cd99	c++17: use std::size	2018-09-06 13:15:59 +03:00
Nekotekina	ca5158a03e	Cleanup semaphore<> (sema.h) and mutex.h (shared_mutex) Remove semaphore_lock and writer_lock classes, replace with std::lock_guard Change semaphore<> interface to Lockable (+ exotic try_unlock method)	2018-09-03 23:00:36 +03:00
Nekotekina	ce4c4696dd	Try to get rid of SIZE_32 macro	2018-09-03 21:40:36 +03:00
kd-11	dea5193fd7	rsx: Fix FP temp register count	2018-09-03 21:39:18 +03:00
kd-11	2e0ecb556c	rsx: Possible fix for UB data type consistency	2018-09-03 18:24:20 +03:00
kd-11	5a08b690d5	gl: always clean up the heap when using legacy buffers	2018-09-03 18:24:20 +03:00
kd-11	815f4aa63b	vk: Improve logging of device names and driver versions	2018-09-03 18:24:20 +03:00
kd-11	e23435a190	vk: whitespace fix only	2018-09-03 18:24:20 +03:00
kd-11	85d38b5751	d3d12: restore working graphics	2018-09-03 18:24:20 +03:00
kd-11	6399833182	rsx: Fix endianness order when immediate mode register is updated, but used as register lookup - Simplify the code by unifying all the register-backed memory	2018-09-03 18:24:20 +03:00
scribam	bf89b709cb	Remove useless #include	2018-08-31 20:13:54 +04:00
scribam	7d0e94ab0a	Compilation fixes for optional on osx	2018-08-31 20:13:54 +04:00
Lassi Hämäläinen	79cf2832ae	Remove Utilities/variant.hpp and use C++17 variant - Remove also Utilities/variant_visitor.hpp - Fix variant and variant_visitor usages and #includes	2018-08-31 17:49:59 +04:00
elad	685eaedbf9	rsx: Fix typos (#5054 )	2018-08-30 00:47:48 +03:00
eladash	37ee0a2f55	Rsx/cellgcm: complete rsx_state::reset()	2018-08-29 13:37:50 +03:00
eladash	fc50e6abcb	Rsx: remove method registers reset cellGcm manually resets registers each flip, tested with cellGcmSetFlip	2018-08-29 13:37:50 +03:00
Dzmitry Malyshau	c452b43ebc	Basic support for Vulkan Portability on OSX	2018-08-28 21:20:11 +03:00
eladash	acf1286b49	Rsx: fix unknown cull faces	2018-08-28 10:47:24 +03:00
eladash	38a72cc6ee	Rsx: fix flip method registers reset driver flip does not reset registers	2018-08-28 10:47:24 +03:00
eladash	e279bdb304	Rsx: add missing default vertex shader attributes registers states	2018-08-28 10:47:24 +03:00
Nekotekina	a93a40e8d9	Disable texture_cache::emit_once (MSVC crash)	2018-08-25 01:58:28 +03:00
Nekotekina	1c6c24f8ac	Update GSL and yaml-cpp submodules	2018-08-25 01:15:47 +03:00
Nekotekina	923314aef5	Rewrite texture_cache::emit_once Also trying to workaround MSVC bug	2018-08-25 01:15:47 +03:00
Nekotekina	5afd12e8a4	[CMake] Use C++17	2018-08-24 16:44:19 +03:00
kd-11	21e1911112	vk: Improve fence wait logic	2018-08-24 14:52:22 +03:00
kd-11	76f102e865	vk: Check for allowed texbuffer range instead of assuming 64M	2018-08-24 14:52:22 +03:00
kd-11	f0a91484a0	vk: Implement double-buffered heaps for platforms without universal support for host visibility (APPLE)	2018-08-23 22:54:56 +03:00
kd-11	c6e35706a3	vk: Support sw component swizzle decode because metal sucks	2018-08-23 22:54:56 +03:00
kd-11	f3d3a1a4a5	rsx: Rework section reuse logic	2018-08-22 17:22:54 +03:00
eladash	874d18f761	rsx-debugger: bugfixes and improvements always translate given address add a few missing methods names *fix branches	2018-08-22 13:19:07 +03:00
eladash	56d553f10d	Rsx: fix cmd jump over put register	2018-08-22 12:20:31 +03:00
eladash	158019b50f	Rsx: fix translation when address is negative move address shift to where it should be, extend io table to catch all possible values.	2018-08-22 12:20:31 +03:00
kd-11	25ec3789fe	rsx: Fix present modes ordering - Newer nvidia drivers are not exposing IMMEDIATE present mode unless you change options in nvidia control panel This can cause severe performance degradation unless the vsync option is set to "off" in control panel	2018-08-19 15:56:07 +03:00
kd-11	7915dcb23c	rsx: Do not overflow the program buffer! - Some games overflow the program buffer e.g Resistance games The observed overflow is one instruction longer, likely an engine bug with counting instructions	2018-08-18 16:14:30 +03:00
kd-11	dd21e43ed5	rsx: Force disable draw reordering when capturing a frame	2018-08-18 16:14:30 +03:00
kd-11	937f1e8cd0	fix gcc build	2018-08-18 16:14:30 +03:00
kd-11	4b2b662c3a	rsx: Followup to the memory inheritance hierachy patch - Tags framebuffer resources on first use (when on_write is called to verify memory) - Texture cache now selects the best match and even sorts atlas writes with memory write order to avoid older data showing over newer one	2018-08-18 16:14:30 +03:00
kd-11	ec31157bc7	gl: Avoid unnecessary scissor state change every draw call.	2018-08-18 16:14:30 +03:00
kd-11	8c93db342f	gl: Reuse framebuffer resources - WIP optimizations for GL backend	2018-08-18 16:14:30 +03:00
kd-11	cca488d0cf	rsx: Enable swizzled decode for all formats unless proven otherwise - Some formats are proven to ignore swizzle flag - DXT compressed textures - COMPRESSED_BG_GB class textures - Some applications are using swizzled wide integer formats so those are confirmed to swizzle	2018-08-18 16:14:30 +03:00
kd-11	f8a9b1fa30	[WIP] rsx: Improve memory inheritance hierachy - Cascade memory writes by invalidating 'downstream' subsurfaces - Fixup; always resolve for overlapping surfaces before sampling (force atlas gather test)	2018-08-18 16:14:30 +03:00
kd-11	ba5b59dc59	gl: Do not create secondary context if async is disabled - Some third party programs fall apart when multiple contexts are created	2018-08-18 16:14:30 +03:00
kd-11	0f36e87010	zcull: Improve the delay algorithm to be more consistent - Use proper time checking; depending on what is being done one 'tick' can be almost a millisecond long or several nanoseconds - Avoid spamming the system timer unless necessary	2018-08-18 16:14:30 +03:00
kd-11	38191c3013	rsx: Avoid acquiring the vm lock; deadlock evasion - A possible deadlock is still present if rsx is trying to get a super_ptr whilst the vm lock holder is in an access violation This patch makes this scenario very unlikely since each block need only be touched once	2018-08-18 16:14:30 +03:00
kd-11	741ee9ac41	rsx: Allow linear filtering when reading back GPU-resident memory	2018-08-18 16:14:30 +03:00
kd-11	cc7848b3ef	vulkan: Fix blit engine transfer to ARGB8 render target memory	2018-08-18 16:14:30 +03:00
kd-11	e9d6096356	vk: Fixups for type b surfaces	2018-08-18 16:14:30 +03:00
kd-11	373e02e91c	rsx: Timestamp accuracy workaround	2018-08-18 16:14:30 +03:00
kd-11	1200ca8172	rsx: Optimize hash_struct; vk cleanup	2018-08-18 16:14:30 +03:00
kd-11	d0165290b6	rsx: Refactor and fix framebuffer layout checks - Refactors shared code back into rsx core - Adds extra check to avoid contest confusion	2018-08-18 16:14:30 +03:00
kd-11	0267221586	Minor optimizations and fixes - FIFO: avoid multiline spam - VK: Fix program setup counter - FS: Precalculate fragment constants buffer size during analysis step	2018-08-18 16:14:30 +03:00
kd-11	9f0fada17a	ZCULL: lower notice severity to avoid spam	2018-08-18 16:14:30 +03:00
kd-11	8800c10476	zcull synchronization tweaks - Implement forced reading when calling update method to sync partial lists - Defer conditional render evaluation and use a read barrier to avoid extra work - Fix HLE gcm library when binding tiles & zcull RAM	2018-08-18 16:14:30 +03:00
kd-11	3b47e43380	rsx: Synchronization rewritten - Do not do a full sync on a texture read barrier - Avoid calling zcull sync in FIFO spin wait - Do not flush memory to cache from the renderer side; this method is now obsolete	2018-08-18 16:14:30 +03:00
Rui Pinheiro	23b52e1b1c	Mark unsync textures dirty when deferred flushing invalidate_range_impl_base does not mark all textures that will only be unprotected as dirty when doing a deferred flush, since that is done by flush_all. However, if there are no sections to flush, the deferred flush will use the same code path as non-deferred flushes for unprotecting textures and forget to mark them as dirty. This commit fixes this bug.	2018-08-16 15:38:36 +03:00
Rui Pinheiro	fa6a5761b3	Refactor get_intersecting_set The existing implementation restarts the loop immediately after finding a range_data instance that updates the trampled_range. This commit refactors this method to continue the loop with the updated trampled_range, and then repeat only those range_data instances that were iterated through before the trampled_range was last updated. As a result, the number of total iterations required is reduced.	2018-08-16 15:38:36 +03:00
Rui Pinheiro	b534d49e48	Fix off-by-one error in get_intersecting_set When the trampled range changes, get_intersecting_set restarts the outer loop. However, due to an off-by-one error, it skips the first cache entry when doing so. This can cause a texture not to be correctly unlocked, which could lead to issues or even deadlocks. This commit fixes this off-by-one error.	2018-08-16 15:38:36 +03:00
Rui Pinheiro	f82ce06fdc	Fix typo in buffered_section::overlaps_page If "address" is not page-aligned, the calculated end address for the corresponding page is incorrect which can lead to false positives, causing some textures to be re-uploaded unnecessarily. This commit fixes this typo.	2018-08-16 15:38:36 +03:00
eladash	061c787e56	rsx-capture: unbreak	2018-08-16 03:27:11 +04:00
Nekotekina	aa4040bb7b	Implement vm::find_map; improve memory allocation Add vm::user64k and vm::user1m constants Remove vm::user_space, unreserve it	2018-08-14 15:14:06 +03:00
eladash	f349695a75	Rsx: rewrite address translation	2018-08-13 16:16:34 +03:00
eladash	9e380a4a4a	Rsx: avoid invalid cmds execution	2018-08-12 23:37:24 +03:00
eladash	c80eb1ba02	Rsx: fix CALL and RET cmd	2018-08-12 23:37:24 +03:00
Megamouse	8cb749110f	RSX Overlay: fix set text	2018-07-30 20:58:41 +02:00
Megamouse	d057c79733	RSX: use localtime_s instead of localtime	2018-07-28 23:10:45 +02:00
Megamouse	f82739afb0	RSX: Add performance overlay colors to configs use a string hex color code. Allowed string formats are: #FFFFFF or #FFFFFFFF (or both without #)	2018-07-28 23:10:45 +02:00
Megamouse	2211e95e41	RSX/Qt: add the possibility to center the performance overlay	2018-07-28 23:10:45 +02:00
Megamouse	67aff85e8e	RSX/Qt: Move rrc captures to captures dir	2018-07-28 23:10:45 +02:00
Megamouse	bb8fa28521	RSX: add config settings for shader compilation hint positions	2018-07-28 23:10:45 +02:00
Megamouse	eecb984689	RSX/Qt: add more performance overlay options to the gui	2018-07-28 23:10:45 +02:00
Megamouse	f8d396ac9e	change rsx_capture filename	2018-07-28 23:10:45 +02:00
kd-11	19d808d378	rsx/gl: Minor cleanup and optimization - Track register change status - Remove unused gl classes	2018-07-22 17:19:59 +03:00
kd-11	8695f95267	rsx: Reimplement cached textures and their views	2018-07-22 17:19:59 +03:00
scribam	65d270e5d8	clang-tidy: performance-faster-string-find https://clang.llvm.org/extra/clang-tidy/checks/performance-faster-string-find.html	2018-07-15 12:51:09 +04:00
kd-11	c35d0d16e7	vk: cleanup/optimization, remove unnecessary initialization code	2018-07-14 15:19:56 +03:00
kd-11	e7f30640ef	rsx: Async shader compilation - Defer compilation process to worker threads - vulkan: Fixup for graphics_pipeline_state. Never use struct assignment operator on vk** structs due to padding after sType member (4 bytes)	2018-07-14 15:19:56 +03:00
kd-11	fa55a8072c	rsx: Improve vertex textures support - Adds proper support for vertex textures, including dimensions other than 2D textures - Minor analyser fixup, removes spurious 'analyser failed' errors - Minor optimizations for program state tracking	2018-07-12 18:02:28 +03:00
kd-11	d266a25618	silence some annoying warnings	2018-07-09 13:06:00 +03:00
kd-11	1ddcad4fa4	facepalm - Fix openGL regression	2018-07-09 13:06:00 +03:00
kd-11	f0a9799b4b	rsx/Qt: Fix Qt shader loading dialog crash	2018-07-07 16:20:33 +03:00
kd-11	4d40ed9dbd	rsx: Silence harmless warning	2018-07-07 16:20:33 +03:00
kd-11	d78957d1cf	rsx/vp: CodeGen improvements - Fix double destination writes on conditional write masking - Fix codegen to simplify simple scalar comparisons vs vector functions	2018-07-07 16:20:33 +03:00
kd-11	2c34195954	rsx/vp: Discard broken vertex programs with no writes to POS register	2018-07-07 16:20:33 +03:00
kd-11	2ca935a26b	vp: Improve vertex program analyser - Adds dead code elimination - Fix absolute branch target addresses to take base address into account - Patch branch targets relative to base address to improve hash matching - Bumps shader cache version - Enables shader logging option to write out vertex program binary, helpful when debugging problems.	2018-07-07 16:20:33 +03:00
kd-11	bd915bfebd	rsx: vp decompiler fixes - Fix program abort logic to never abort before resolving later label addresses Fixes jumping over broken code and jumping over END markers - TEXTURE_CONTROL2 has indexing range of [0..15] without stride skipping! This register does not have interleaving with other texture registers - Track shader address poke as it seems to invalidate programs as well	2018-07-07 16:20:33 +03:00
kd-11	66854b78fa	rsx: Fix nv308a::color	2018-07-07 16:20:33 +03:00
eladash	345f92ab0a	rsx: more efficient command reading	2018-06-27 21:59:34 +03:00
kd-11	42851a93d4	vk: Fixup	2018-06-26 20:07:20 +03:00
kd-11	24f4c92759	rsx: Improve texture cache read speculation	2018-06-26 20:07:20 +03:00
kd-11	c3d75e208f	glslang: Bump to version 2776	2018-06-26 20:07:20 +03:00
kd-11	df2137781d	vk: Strip 'stencil' MSB when writing d24x8 data - Seems to contains garbage in MSB when DEPTH aspect is read back - TODO: Implement custom depth and stencil readback routine	2018-06-26 20:07:20 +03:00
kd-11	1e375e5210	gl: Fixup	2018-06-26 20:07:20 +03:00
kd-11	1730708f47	rsx: Rework memory protection management for framebuffer access - Avoid re-locking memory if there is no reason to do so (no draws issued) - Actively bound regions should always get written to the backing cache - Forcefully read memory during download if writes to the target have occured since last sync event	2018-06-26 20:07:20 +03:00
kd-11	bda65f93a6	vk: Tuning [WIP] - Unroll main compute queue loop - Do NOT run GPU cores on mappable memory! This has a dreadful impact on performance for obvious reasons - Enable dynamic SSBO indexing (affects AMD) - Make loop unrolling and loop length variable depending on hardware and find optimum	2018-06-26 20:07:20 +03:00
kd-11	d484253136	vk: Addendum - Synchronize graphics and compute pipelines with barriers	2018-06-26 20:07:20 +03:00
kd-11	5fb4009a07	vk; Add more compute routines to handle texture format conversions - Implement le D24x8 to le D32 upload routine - Implement endianness swapping and depth format conversions routines (readback)	2018-06-26 20:07:20 +03:00
kd-11	f45dcfe18a	rsx: Fix texture readback - gl: Fix up the calculation for internal image pitch - vk: Implement GPU-side resizing for read back textures (fixes WCB zoom)	2018-06-26 20:07:20 +03:00
kd-11	278cb52f19	facepalm	2018-06-26 20:07:20 +03:00
eladash	b9fd6ad37d	rsx-capture: unbreak	2018-06-24 15:14:37 +03:00
Jake	00c9b323c2	rsx: fix image_in to use in_pitch when swizzling	2018-06-24 14:29:41 +04:00
eladash	b456955688	rsx: fix hardcoded rsx allocation address	2018-06-24 10:57:30 +03:00
Nekotekina	d62b0c88b0	Restore /dev_flash/ config	2018-06-23 09:26:11 +03:00
Nekotekina	81e5f3b7f2	Remove /dev_flash/ configuration Simplify code by using root config location for /dev_flash/ Hide Emu.GetEmuDir() function due to the risk of misuse	2018-06-22 14:14:29 +03:00
eladash	3e433ef05c	create the shaderlog dir in Emu.Init()	2018-06-21 22:54:08 +04:00
kd-11	8f1c36d79f	rsx: Fix region pitch inaccuracy - Region pitch of 64 (disabled) can be used to indicate packed contents - do not assume it is the actual pitch! - Also fixes interaction of AA factors with lockable_region size	2018-06-21 13:08:50 +03:00
Megamouse	0ffaf6c1ce	Qt/RSX: reset gs_frame taskbar progr. when closing native progr. dialogs I forgot that the gs_frame's taskbar progress will stay unchanged due to its persistence, unlike the msgdialog.	2018-06-19 21:58:09 +03:00
VelocityRa	dd0684b58a	overlays/perf_overlay: Make pos, font, opacity, margin configurable - Also some perf overlay refactoring	2018-06-18 22:34:26 +03:00
VelocityRa	58f4b26134	overlays: Consider padding when auto-resizing label elements - Also make auto_resize return whether it changed the size	2018-06-18 22:34:26 +03:00
VelocityRa	84d1cf5065	overlays: Change how padding and margin work - Also add 3 helper functions to compiled_resource	2018-06-18 22:34:26 +03:00
VelocityRa	07731d0228	overlays: Fix text rendering position - It should be offset by one line, not half - Also tweak every overlay element affected by this to look as it used to	2018-06-18 22:34:26 +03:00
VelocityRa	44449dd9e9	overlays: Refactoring - Use names for overlay command config and vertex data instead of std::pair. - Make a couple of compiled_resource constructors explicitly named functions.	2018-06-18 22:34:26 +03:00
kd-11	c60f7b89ba	vk: Implement safe typeless transfer - Used to transfer D32S8 data where it makes sense to use this variant - On nvidia cards, it is very slow to move aspects from D24S8 probably due to the format being faked. For this reason, the unsafe variant is used for both D16 and D24S8 to avoid the heavy performance loss	2018-06-18 17:32:22 +03:00
kd-11	0d61bae1b9	vk: Performance fix for RADV - RADV does not keep a mapping ptr around for subsequent remap and falls back to heavy amdgpu methods every time Explicitly manage pointer in the ring buffer structure to fix this	2018-06-18 17:32:22 +03:00
kd-11	d77e62c94e	rsx: Improve GPU resource read prediction	2018-06-18 17:32:22 +03:00
kd-11	2afcf369ec	vk: Add synchronous compute pipelines - Compute is now used to assist in some parts of blit operations, since there are no format conversions with vulkan like OGL does - TODO: Integrate this into all types of GPU memory conversion operations instead of downloading to CPU then converting	2018-06-18 17:32:22 +03:00
kd-11	0d5c071eee	vk: Implement typeless image transport	2018-06-18 17:32:22 +03:00
kd-11	dd4c13b625	rsx: Avoid race conditions in unsynchronized unprotect	2018-06-18 17:32:22 +03:00
scribam	6f3a63df94	fix compiler warning "-Winconsistent-missing-override"	2018-06-13 20:22:53 +02:00
Megamouse	a8f19fbfae	RSX: fix shader cache progress bar exit state shenanigans	2018-06-11 22:41:38 +03:00
Megamouse	4003aacc6a	RSX: add taskbar progress to native ui progress dialogs	2018-06-08 23:41:56 +03:00
Megamouse	b9b6bd85a6	Qt/RSX: add taskbar progress in msg_dialog_frame for shader compilation	2018-06-08 23:41:56 +03:00
Megamouse	edc9e9b4ec	Qt/RSX: add window title to the shader compilation msg_dialog	2018-06-08 23:41:56 +03:00
kd-11	da9affb348	vk: Fix overlays for variable sampler count	2018-06-08 22:17:50 +03:00
kd-11	00eaf39c01	vk: RADV support for depth scaling and transfer	2018-06-08 22:17:50 +03:00
kd-11	63f803b68a	Refactoring - Make the memory allocator a unique child of the render device. Fixes object lifetime issues with swapchain management due to cyclic dependencies	2018-06-08 22:17:50 +03:00
kd-11	c9e367befd	rsx/debug: Fix rendering when FIFO reordering is disabled	2018-06-08 22:17:50 +03:00
kd-11	1c5667f0ce	vk: Avoid use-after-free fence object	2018-06-08 22:17:50 +03:00
kd-11	e9c3ab7ae6	fix some linux issues - Fix build - Fix VMA incompatibility with swapchain_NATIVE	2018-06-08 22:17:50 +03:00
kd-11	1b9c9267f0	rsx: Update memory flags after memory transfer	2018-06-08 22:17:50 +03:00
kd-11	fc18e17ba6	vk: Implement depth scaling using hardware blit/copy engines - Removes the old depth scaling using an overlay. It was never going to work properly due to per-pixel stencil writes being unavailable - TODO: Preserve stencil buffer during ARGB8->D32S8 shader conversion pass	2018-06-08 22:17:50 +03:00
kd-11	3150619320	rsx: Preserve read AA state separate from write AA state - Some applications (e.g Backbreaker) use an evil hack to resolve MSAA. The application respecifies a formerly AA region as a region with no AA then performs a framebuffer feedback lookup. The old memory keeps AA during read, but writes back to itself with AA resolved. This is evil on several levels but it just happens to work on PS3	2018-06-08 22:17:50 +03:00
kd-11	0f24379c0e	rsx: Obey MSAA resolve during memory persistence transfer - Ugh. This is a bandaid on a festering wound, AA badly needs a rewrite Also silence some warnings	2018-06-08 22:17:50 +03:00
Dravonic	400079a006	Parallel shader cache loading (#4677 ) * Parallel shader cache loading	2018-06-01 19:49:29 +03:00
kd-11	87b510d5bf	vulkan improvements - Remove deprecated device layers - Reimplement overlays resource management using real heap instead of using first_vertex hack	2018-05-31 11:22:40 +03:00
kd-11	9f9e1b5fe0	overlay; Fix leak	2018-05-31 11:22:40 +03:00
kd-11	cf2cb7978b	facepalm	2018-05-30 13:30:23 +03:00
kd-11	f543fb0243	vk/gl: Fix flush synchronization to be kinder to weaker CPUs but not harm higher end CPUs	2018-05-30 13:30:23 +03:00
kd-11	6362942928	rsx: Avoid semaphore acquire deadlock	2018-05-30 13:30:23 +03:00
VelocityRa	c8d8a81ccd	overlays: Performance Overlay	2018-05-30 12:35:41 +03:00
VelocityRa	c2e17d04e1	overlays: Font improvements * Support for using arbitrary firmware fonts * Support for specifying font extension in `font` constructor (useful for most firmware fonts that use .TTF)	2018-05-30 12:35:41 +03:00
VelocityRa	33b01d9306	overlays: Allow for non-interactable UI components * Also fix a few warnings in overlay_controls	2018-05-30 12:35:41 +03:00
eladash	23b380eb41	allow deallocations to unmap rsx mapped memory	2018-05-29 19:57:28 +03:00
kd-11	83f9be2524	rsx: Promote FIFO optimizations outside of strict mode - The benefits of FIFO optimizations are huge in some cases. The optimizations also do not break any tested applications so no need to disable with strict mode - A debug option is provided to disable this behaviour for testing	2018-05-29 13:54:30 +03:00
kd-11	2adb2ebb00	overlays: Avoid race condition on remove-on-update views - Improves cleanup code to consist of 2 parts, remove then dispose. Remove does not deallocate the item until dispose is called on it, allowing the backends to first deallocate external references. - Caller is responsible for managing list locking and tracking disposable list of items when external references have been cleaned up before using dispose method.	2018-05-29 13:54:30 +03:00
kd-11	0fc67aa2f6	gl: fix wcb regression - Partial framebuffers and blit targets are possible!	2018-05-24 10:36:04 +03:00
kd-11	493d4e8613	fixup - Improve invalidated region checks for performance	2018-05-24 10:36:04 +03:00
kd-11	b030d1900c	rsx: Fixup - fix broken memory protection fail caused by region respec - Some applications will alternate memory between framebuffer and texture data	2018-05-24 10:36:04 +03:00
kd-11	f38f61d110	vk: Clean up memory allocation and fix GPUOpen VMA for Radeon	2018-05-23 19:07:08 +03:00
kd-11	92b5a705d8	fixup - locking	2018-05-23 19:07:08 +03:00
kd-11	b957eac6e8	rsx: Avoid calling any blocking callbacks from threads that are not rsx::thread - Defers on_notity_memory_unmapped to only run from within rsx context - Avoids passive_lock + writer_lock deadlock	2018-05-23 19:07:08 +03:00
kd-11	d2bf04796f	Optimized cached write-through - Allows grabbing an unsynchronized memory block if overwriting contents anyway - Allows flushing only specified range of memory	2018-05-23 19:07:08 +03:00
kd-11	f8d999b384	fixup - range check	2018-05-23 19:07:08 +03:00
kd-11	fbf6581249	rsx: Fix segmented memory access for rsx::super_ptr	2018-05-23 19:07:08 +03:00
kd-11	d283200e13	vk: Do not do extension test if in a fast context (enum only)	2018-05-23 19:07:08 +03:00
kd-11	3f14bc6961	rsx: Silence some meaningless error	2018-05-23 19:07:08 +03:00
kd-11	f2a3167193	rsx: Lower format compatibility severity since it confuses some people	2018-05-23 19:07:08 +03:00
kd-11	8fcd5c1e5a	rsx: Texture cache fixes 1. rsx: Rework section synchronization using the new memory mirrors 2. rsx: Tweaks - Simplify peeking into the current rsx::thread instance. Use a simple rsx::get_current_renderer instead of asking fxm for the same - Fix global rsx super memory shm block management 3. rsx: Improve memory validation. test_framebuffer() and tag_framebuffer() are simplified due to mirror support 4. rsx: Only write back confirmed memory range to avoid overapproximation errors in blit engine 5. rsx: Explicitly mark clobbered flushable sections as dirty to have them removed 6. rsx: Cumulative fixes - Reimplement rsx::buffered_section management routines - blit engine subsections are not hit-tested against confirmed/committed memory range Not all applications are 'honest' about region bounds, making the real cpu range useless for blit ops	2018-05-23 19:07:08 +03:00
pauls-gh	f8a0be8c3e	Performance enhancement - Vulkan memory allocator (#4635 ) * Incorporates the vulkan memory allocator from the AMD GPUOpen project	2018-05-23 17:02:35 +03:00
scribam	2270b8d15c	vulkan: link with vulkan-1.lib instead of VKstatic.1.lib	2018-05-23 13:54:27 +03:00
Nekotekina	72574b11ff	SPU: use reservation spinlocks on writes (non-TSX) This should decrease contention by avoiding global lock	2018-05-21 21:56:14 +03:00
kd-11	c9669818eb	Facepalm - overlays: Do not free self handle!!!!	2018-05-21 15:55:25 +03:00
kd-11	f6f45b8699	Native UI refactored (#4623 ) Refactor and improve native overlays	2018-05-20 23:05:00 +03:00
Nekotekina	367f039523	Build transactions at runtime Drop _xbegin family intrinsics due to bad codegen Implemented `notifier` class, replacing vm::notify Minor optimization: detach transactions from global mutex on TSX path Minor optimization: don't acquire vm::passive_lock on PPU on TSX path	2018-05-16 17:31:58 +03:00
scribam	04ad49de4d	typos	2018-05-14 21:14:39 +04:00
kd-11	4836a03a7d	rsx: Fix build	2018-05-13 14:44:14 +03:00
kd-11	9d1f4a2538	vk: Include RADV POLARIS and RADV VEGA in the primitive restart blacklist	2018-05-13 14:44:14 +03:00
kd-11	bff6060bd6	rsx: Improve puller state management - Properly identify puller spin primitives - Add a small wake delay after exiting a spin delay. Fixes desynchronization It seems real hw has a small delay between cell edits to commandbuffer memory at the GET address and the changes becoming visible to the DMA puller Simulated with a short busy_wait, large values will improve sync but degrade performance	2018-05-13 14:44:14 +03:00
kd-11	1aa44ede31	gl: Improve AMD multidraw workaround - Reimplements the AMD workaround using an identity buffer to avoid the performance hit of doing multiple glDrawArrays for every single compiled set - Reimplements first/count allocation using a scratch buffer to reduce allocation overhead when large number of draw calls is used	2018-05-13 14:44:14 +03:00
kd-11	eccb57d4b8	vk: AMD primitive restart bug workaround - Emulate primitive restart with degenerate triangles	2018-05-13 14:44:14 +03:00
kd-11	b7979d3f57	rsx/vk: Improvements and minor optimizations - Improve dirty state tracking affecting program state - vk: Refactor out transform constants upload into a separate channel to avoid if possible transform data uploads are quite expensive	2018-05-13 14:44:14 +03:00
kd-11	440a31ef18	rsx: Optimizations for program management	2018-05-13 14:44:14 +03:00
kd-11	a52ea7f870	rsx: Improve fragment and vertex program usage - Introduces a gpu program analyser step to examine shader contents before attempting compilation or cache search - Avoids detecting shader as being different because of unused textures having state changes - Adds better program size detection for vertex programs - Improved vertex program decompiler - Properly support CAL type instructions - Support jumping over instructions marked with a termination marker with BRA/CAL class opcodes - Fix SRC checks and abort - Fix CC register initialization - NOTE: Even unused SRC registers have to be valid (usually referencing in.POS)	2018-05-13 14:44:14 +03:00
Jake	75b40931fc	rsx: initial capture/replay functionality (#4510 ) * rsx: initial capture/replay functionality	2018-05-13 12:18:05 +03:00
Nekotekina	5d15d64ec8	Memory mirror support Implemented utils::memory_release (not used) Implemented utils::shm class (handler for shared memory) Improved sys_mmapper syscalls Rewritten ppu_patch function Implemented vm::get_super_ptr (ignores memory protection) Minimal allocation alignment increased to 0x10000	2018-05-09 23:35:34 +03:00
kd-11	98b715d8c8	gl: Workaround for AMD driver bug	2018-04-25 19:14:36 +03:00
kd-11	ffa62918aa	gl: Improve pixel transfer code and notify on AMD driver bug - Readback does not work at all with float textures on AMD openGL Driver throws a bogus OUT_OF_MEMORY error regardless of amount of VRAM and system RAM available	2018-04-25 19:14:36 +03:00
kd-11	f3210a9a33	rsx: Workaround for lost memory sections - TODO: surface_cache and texture_cache need a better method of persisting partial framebuffer resources	2018-04-25 19:14:36 +03:00
kd-11	58035697d5	rsx: Restore component mapping override for depth textures	2018-04-25 19:14:36 +03:00
kd-11	7e32e7343a	vk: Reorganize handling of formats support - Formats support is linked to the physical device and by extension the logical device derived from it It therefore makes no sense to track this as a separate object. Simplifies parameter passing and template specialization. Also avoids corner cases with AMD hardware (where D24S8 is not supported)	2018-04-25 19:14:36 +03:00
kd-11	291a828217	fixups	2018-04-25 19:14:36 +03:00
kd-11	40ae5e605d	vk: Fix border color selection	2018-04-25 19:14:36 +03:00
kd-11	c5d1f30e82	rsx: Fix performance counters - Detect jump-to-self type idling	2018-04-25 19:14:36 +03:00
kd-11	91a6091d26	rsx: Minor fixes - vk: Clear dirty textures before copying 'old contents' in case the old data does not fill the new region - rsx: Properly decode border color - seems to be in BGRA format - vk: better approximation of border color to better choose between the presets - vk: Individually clear color images outside render pass and without scissor - vk: Fix renderpass selection for clear overlay pass - vk: Include scissor region when emulating clear mask NOTES: - vk: Completely avoid using vkClearXXXXimage - its 'broken' on nvidia drivers Spec is vague about the function so its not an actual bug ClearAttachment is clearly defined as bypassing bound state which works correctly - TODO: Implement memory sampling to simulate loading precleared memory if cell used memset to preinitialize the framebuffer Autoclear depth to 1\|255 and color to 0 is hacky!	2018-04-25 19:14:36 +03:00
kd-11	da99f3cb9a	rsx: Critical fixes - texture cache: Avoid leaking memory sections - Avoid double ref increment on flush-always reprotection - Detect invalidated_resources entries in surface cache when protecting fbo memory - vk: Copypasta bugfix, properly initialize aspect mask	2018-04-25 19:14:36 +03:00
kd-11	a42b00488d	rsx: Texture fixes - gl/vk: Fix subresource copy/blit - gl/vk: Fix default_component_map reading - vk: Reimplement cell readback path and improve software channel decoder - Properly name the subresource layout field - its in blocks not bytes! - Implement d24s8 upload from memory correctly - Do not ignore DEPTH_FLOAT textures - they are depth textures and abide by the depth compare rules - NOTE: Redirection of 16-bit textures is not implemented yet	2018-04-25 19:14:36 +03:00
kd-11	63d9cb37ec	rsx: Framebuffer fixes Primary: - Fix SET_SURFACE_CLEAR channel mask - it has been wrong for all these years! Layout is RGBA not ARGB/BGRA like other registers Other Fixes: - vk: Implement subchannel clears using overla pass - vk: Simplify and clean up state management - gl: Fix nullptr deref in case of failed subresource copy - vk/gl: Ignore float buffer clears as hardware seems to do	2018-04-25 19:14:36 +03:00
kd-11	9abbbb79ae	rsx: Blit engine fixes - Ignore unlocked blit sections [TODO] - Do not attempt blit on hw if bytesize is unsupported - gl: Implement typeless memory transfers Uses pbo to handle type-agnostic memory transfer	2018-04-25 19:14:36 +03:00
kd-11	bb5622401c	overlays/gl: minor fixes - fix ogl color map for overlay resources - fix label background for save dialog	2018-04-25 19:14:36 +03:00
kd-11	6d46ac1ad6	gl: Reimplement textures - Separate texture data from texture views	2018-04-25 19:14:36 +03:00
kd-11	cf1b700ebc	rsx: Improve format mismatch detection hack	2018-04-25 19:14:36 +03:00
kd-11	c5cd758700	rsx: Workaround for G8B8 render targets - Mainly affected are colormasks and read swizzles NOTES: - Writes to G write to the second and fourth component (YW) - Writes to B write to first and third component (XZ) - This means the actual format layout is BGBG (RGBA) making RG mapping actually GR - Clear does not seem to have any intended effect on this format (TLOU)	2018-04-25 19:14:36 +03:00
JohnHolmesII	7303f04bc5	Minor bugfix	2018-04-10 15:06:56 +03:00
Talkashie	64992f758d	Fix typos (#4410 ) * MASSIVE TYPO FIX part 1 * ANOTHER HUUUUGE TYPO FIX part 2 * thank you :hcorion: for all of your help. I could not have done this without you	2018-04-08 01:01:39 +01:00
kd-11	568118634e	vk: Squash some spec violations that went unnoticed	2018-04-05 01:06:50 +03:00
kd-11	cfd0b8a975	rsx: Fix alphakill	2018-04-05 01:06:50 +03:00
kd-11	93b2776604	rsx: Fix vertex input detection - Properly detect inline array registers vs constant value registers - Silence needless spam, 306E is 2D surface engiine, the assumption that y is multiplied by 306E pitch is not crazy	2018-04-05 01:06:50 +03:00
kd-11	53f2533a08	rsx: Implement proper Z-order curve in 3 dimensions - Should fix garbage palette textures getting uploaded (LSD graphics)	2018-04-05 01:06:50 +03:00
kd-11	e291494282	rsx: Texture cache updates - Properly implement section gather for 3d and cubemaps Implements render-to-3d and fixes some corner cases for render-to-cubemap	2018-04-05 01:06:50 +03:00
Nekotekina	da9baac842	Improve save data dialog Bug fix: don't display new data entry when not asked for Use icon/title provided by the game for the new data entry Display new data entry at the beginning of list when necessary Minor cellSaveData cleanup	2018-04-02 16:27:28 +03:00
Jake	6d6d6fa827	dx12/vk/gl: implement use of vertex_data_base_index when calculating index	2018-03-30 13:30:04 +03:00
kd-11	ee0fe28ddc	rsx: Fix copypasta	2018-03-29 13:52:11 +03:00
kd-11	7627ad04f1	rsx: Disable gamma control on WZYX textures - Gamma is seemingly used for (D/X/A)RGB only. Data textures are unaffected	2018-03-29 13:52:11 +03:00
kd-11	f559c088a1	rsx: Dubious fix for broken B8 swizzle remap	2018-03-29 13:52:11 +03:00
pauls-gh	a17025c465	Strict Rendering Mode (SRM) fix. Move old surface copy before texture upload. Fixes the following issues on Tales of Vesperia which requires SRM. - Blacked out scene after the sleeping dog now renders correctly - Ghosting effect. The ghosting was most noticeable as a delay between the character rendering and the cell shading around the character. This appears to be gone with this change.	2018-03-29 11:01:58 +03:00
kd-11	fca6c4fb7f	vk: Improve compatibility workarounds - Disable fence reset on gcn1 - gcn3 - Enable polaris+ workarounds on linux if proprietary driver is used	2018-03-28 11:30:11 +03:00
Kravickas	e6cb4df877	Fix a typo in gcm_enums.cpp (#4333 ) * Update gcm_enums.cpp	2018-03-27 18:43:38 +01:00
kd-11	5aac8aa424	rsx: Clamp negative fog distance	2018-03-25 16:02:47 +03:00
kd-11	887ea43e39	rsx: Fix some texture cache problems - gl/vk: Properly handle remapping temporary resources	2018-03-25 13:31:06 +03:00
kd-11	9fce5b0f7a	gl: Fix leaking occlusion queries - GL queries share the target binding (not asynchronous!) - Discard active queries by closing them, leave closed queries alone (nothing to be done for discard op)	2018-03-25 13:31:06 +03:00
kd-11	22af70d0d0	gl: Always use indexed blend caps to avoid conflict with the state cache. - glEnable/glDisable should not be used with GL_BLEND as the main renderer uses the indexed variant	2018-03-25 13:31:06 +03:00
kd-11	321c360dcb	rsx: Overhaul rendertarget sampling/shuffles - Reimplements render target views used for sampling - Optimizes access using an encoded control token - Adds proper encoding for 24-bit textures (DRGB8 -> ORGB/OBGR) - Adds proper encoding for ABGR textures (ABGR8 -> ARGB8) - Silence some compiler warnings as well - TODO: Real texture views for OGL current method is a hack	2018-03-25 13:31:06 +03:00
kd-11	9bb1ed78f9	gl: Implement video-out calibration for gamma and dynamic range - Seems to be of limited use but if it is determined to be useful, a vulkan implementation can be done	2018-03-25 13:31:06 +03:00
kd-11	9fc1740608	rsx/fp: Fragment program overhaul - Separate TXB from TXL: They are completely different! - Properly perform TMU emulation in the fragment shader. Implemens SRGB conversion and alphakill at the moment - Properly perform ROP emulation in the fragment shader. Implements FRAMEBUFFER_SRGB. While support on the chip looks to be incomplete (and wierd), it does work - Document some more bits in SHADER_CONTROL register	2018-03-25 13:31:06 +03:00
kd-11	c6a2525c9b	video out: Set up video configuration options in fxm - TODO: Actually do something with the values	2018-03-25 13:31:06 +03:00
kd-11	aeebeed0f2	vk: Fix AMD primitive restart emulation when strict mode is active The restart emulation is there to keep the proprietary drivers from randomly crashing when using primitive restart	2018-03-25 13:31:06 +03:00
kd-11	9f416e5ce1	rsx/gl/vk: Obey channel remapping on framebuffer resources if requested	2018-03-25 13:31:06 +03:00
kd-11	27552891ad	rsx/fp: Improvements - Export some debug information in the free texture register space components zw Very useful when analysing renderdoc captures - Enable shadow comparison on depth as long as compare function is active and texture is uploaded for depth read Some engines (UE3) read all the components in the shader and use mul/mad with the result	2018-03-25 13:31:06 +03:00
kd-11	5817f9fe3f	rsx: Texture format fixes - Implement SRGB (gamma corrected) textures (DXT1, DXT3, DXT5, RGBA8 only) - Fix channel map decode for XY data texture formats - Fix remap layout for X16 textures (verified with Mass Effect 3)	2018-03-25 13:31:06 +03:00
scribam	50446f7fef	Partial compilation fixes for osx	2018-03-24 11:14:40 +00:00
pauls-gh	44cddda4b4	Fix VTC source index increment	2018-03-23 12:01:30 +03:00
pauls-gh	d79a544320	VTC tiling - fix source offset increment.	2018-03-23 12:01:30 +03:00
pauls-gh	e5b4710471	Add end condition for VTC copy. This handles the case when depth is not a multiple of 4.	2018-03-23 12:01:30 +03:00
pauls-gh	e6010ba2ca	Fix code formatting	2018-03-23 12:01:30 +03:00
pauls-gh	fd8d2ecbf4	Remove Volume Texture Compression (VTC) tiling for Vulkan, DX12 and ATI (OpenGL).	2018-03-23 12:01:30 +03:00
Megamouse	9d961f620b	rsx/Qt: add option to disable the shader compilation hint	2018-03-22 16:33:37 +04:00
GeniusMage	4e2a8a3cdb	Teepo fix 2D isn't always > 3D	2018-03-21 13:40:34 +03:00
kd-11	92fb828d52	gl: Compat support for mesa drivers Needs CLIENT_STORAGE bit set for persistent buffers to make them useful	2018-03-20 00:11:41 +03:00
kd-11	d13584f858	rsx: fixups gl/vk: Bump shader cache version gl/vk: Disable anisotropic override when strict mode enabled as it is proven to alter some games negatively gl: Clamp buffer view range to not exceed the backing buffer size. Also add assert for the same condition	2018-03-19 12:13:34 +03:00
kd-11	5f047034ae	rsx: Disable async count verification to avoid lockup due to zombie reports in ZCULL	2018-03-13 18:55:03 +03:00
kd-11	ffe6c9ba5a	fix linux builds	2018-03-13 18:55:03 +03:00
kd-11	910fc54ee2	vk: Implement reading from cell if swap image isn't found	2018-03-13 18:55:03 +03:00
kd-11	f00d9a7c7f	rssx" Halfplement alpha-to-coverage AA transparency	2018-03-13 18:55:03 +03:00
kd-11	2dce55d036	rsx: ZCULL synchronization fixes - Track asynchronous operations in RSX core - Add read barriers to force pending writes to finish. Fixes zcull delay flicker in all UE3 titles without forcing hard stall - Increase zcull latency as all writes should be synchronized now	2018-03-13 18:55:03 +03:00
kd-11	315798b1f4	rsx: ZCULL rewrite and other improvements - ZCULL unit emulation rewritten - ZCULL reports are now deferred avoiding pipeline stalls - Minor optimizations; replaced std::mutex with shared_mutex where contention is rare - Silence unnecessary error message - Small improvement to out of memory handling for vulkan and slightly bump vertex buffer heap	2018-03-13 18:55:03 +03:00
kd-11	dece1e01f4	rsx: Improve transform constants management - Removes the duplicate local_transform_constants - Resets the transform constants on every context reset - Simplifies the code abit which should make it faster - NOTE: Transform constants are persistent across context re-init events (VF5)	2018-03-13 18:55:03 +03:00
kd-11	a19ffba8e8	rsx: Simplify MRT blend setup; Enable separable MRT blend on vulkan and fix corner cases for GL	2018-03-13 18:55:03 +03:00
kd-11	e230867492	rsx: Properly implement raster window offsets	2018-03-13 18:55:03 +03:00
kd-11	0c8e4c0887	rsx: Improve FIFO commandlist flattening - TODO: Alot of work is still needed to execute draw commands out of order Thats the only solution to games sending many draw calls with high frequency of state changes	2018-03-13 18:55:03 +03:00
kd-11	84b8a08d26	rsx: Basic performance counters	2018-03-13 18:55:03 +03:00
kd-11	2855869530	vulkan: Support APPLE - Adds support for compilation on MAC with moltenVK. Note that vulkan does not work on MacOS yet. There are two main blockers:- 1) Texture component swizzles are not supported except for RGBA8_UNORM->BGRA8_UNORM. 2) There is a bug in their SPIR-V -> MSL generator. GLSL.std.450.xxxx functions are not implemented which breaks rpcs3 functionality. Trying to compile a vertex shader will throw because unpackHalf2x16 is missing.	2018-03-13 18:55:03 +03:00
kd-11	d41b49d8b4	rsx/fp: Color output registers are always present and zero initialized - According to NV_fragment_program spec, registers are zero initialized always - A program even without writing to these registers will have black (0, 0, 0, 0) output Confirmed behaviour with MotorStorm games. Their engine uses this quirk to clear color buffers when doing depth replace Might be an unfixed game bug	2018-03-13 18:55:03 +03:00
kd-11	4804efc17d	rsx: Clear up confusion on depth writes. According to the NV_fragment_program spec, its not feasible to have 16-bit depth wries NOTE: NV_fragement_program precedes NV_fragment_program2 which is very close to what RSX consumes. It is hardware from that era afterall	2018-03-13 18:55:03 +03:00
kd-11	053ab585f4	gl/vk: Clean up some format casts - TODO: Byte ordering considerations on data casts	2018-03-13 18:55:03 +03:00
kd-11	20d4c09a1c	rsx/vk/gl: Enforce format matching for render target resources. Fall back to raw data copy if match fails - Forces Bitcast of texture data if input format cannot possibly be the same as the existing texture format - rsx: Other minor improvements to texture cache :- - remove obsolete blit engine incompatibility warning. The texture will be re-uploaded if it is indeed incompatible - Implement warn_once and err_once to avoid spamming the log with systemic errors - Track mispredicted flushes - Reswizzle bitcasted texture data to native layout TODO: Also needs reshuffle according to input remap vector	2018-03-13 18:55:03 +03:00
kd-11	68b3229756	rsx/fp: Improve rgister component gather detection - Also avoids clobbering register data by keeping gathered bits in a temp var	2018-03-13 18:55:03 +03:00
kd-11	87741141f1	rsx/vulkan: Add post-compilation key validation and dynamically determine attachment write maks based on decompiled shader - A new step is added between decompilation and pipeline object creation allowing for properties to be updated based on shader contents - Allos masking off attachment writes that are unmodified in the shader	2018-03-13 18:55:03 +03:00
kd-11	705820c430	rsx: Nvidia driver compatibility workarounds - Sanitize NaN values before they reach the driver. On nvidia (X * NaN = X)	2018-03-13 18:55:03 +03:00
kd-11	6b23e733d0	rsx/gl/vk: Improvements - gl: Do not call makeCurrent every flip - it is already called in set_current() - gl: Improve ring buffer behaviour; use sliding window to view buffers larger than maximum viewable hardware range NV hardware can only view 128M at a time - gl/vk: Bump transform constant heap size When lots of draw calls are issued, the heap is exhaused very fast (8k per draw) - gl: Remove CLIENT_STORAGE_BIT from ring buffers. Performance is marginally better without this flag (at least on windows)	2018-03-13 18:55:03 +03:00
kd-11	07cbf3da48	rsx/gl: Minor fixes - Identify depth textures reaching the gpu via shader_read upload path - Use correct timestamp counter for opengl - inline draw_state::test_property because msvc doesnt do it for us	2018-03-13 18:55:03 +03:00
kd-11	af1b13550b	rsx/vk: More optimizations - Do not bother rechecking the dirty sampler pool for hits. Its faster to create new sampler than to search the pool - Reserve some memory on vertex layout struct to reduce reallocation penalty	2018-03-13 18:55:03 +03:00
kd-11	8ccaabb502	vulkan: Optimize vertex data upload - Reuse buffer views as much as possible, vkCreateBufferView is slow on NV Implemented as a large sliding window, reuseable until it is filled	2018-03-13 18:55:03 +03:00
kd-11	01349b8cee	rsx: Texture cache fixes - Optionally attempt to merge framebuffers into an atlas if partial resources are missing - Support for data update requests to the temporary subresource handler This is useful for framebuffer feedback loops where a new copy is needed after every draw call (resource is always dirty)	2018-03-13 18:55:03 +03:00
kd-11	4487cc8e7a	Remove an ugly hack pertaining to partial framebuffer-resident texture data - Its better to fill in the missing information with a wrap or clamp than to fake the texture reads in valid regions - Texture coordinate scaling is used to fill in for the cropped dimension available	2018-03-13 18:55:03 +03:00
Jake	3a9428a88e	rsx: Fix alpha test on VK/GL Moving alpha test outside of fp_main. This enforces alpha test is still done even if shader asm has early return	2018-03-09 18:06:20 +03:00
Jake	7233640cf0	rsx: add vertex data base to offset and mask before translating address	2018-03-07 16:57:20 +03:00
Megamouse	6141bc5598	Fix firmware installation for vfs	2018-02-28 17:27:06 +04:00
kd-11	4c6ceeff11	rsx/vk: Cleanup - Silence some warnings, remove dprints	2018-02-21 14:59:46 +03:00
kd-11	77f2b521e1	vulkan: Swapchains reimplemented - Adds support for abstract implementations - Adds native windowing implementations for WIN32 and X11 as fallbacks when present support is lacking (headless configs)	2018-02-21 14:59:46 +03:00
kd-11	ee88e7f948	rsx/gl: Hotfix for GL format compatibility check when high precision depth buffers is enabled	2018-02-16 16:14:54 +03:00
kd-11	ebb9fc7cd9	vk/drivers: Restore polaris+ patch to disable primitive restart	2018-02-16 16:14:54 +03:00
kd-11	32f0e91893	vulkan: Improve primitive restart workaround for vega/polaris - For some reason the hardware forgets that primitive restart is enabled and tries to actually read vertex index 65535 - Works correctly if uint32 vertex indices are used instead of uint16 for cases where primitive restart is active	2018-02-16 16:14:54 +03:00
kd-11	a8ab408f64	rsx: Account for null blit ops (memcpy) - Do not perform extra memory tasks if no actual image copy was performed	2018-02-16 16:14:54 +03:00
kd-11	661b8b006f	rsx: Add texture readback statistics to the texture cache and debug overlay	2018-02-16 16:14:54 +03:00
kd-11	1bd77c2f51	rsx: Add cache pattern checking to blit engine resources - Feature was implemented long ago but was not functional due to bugs	2018-02-16 16:14:54 +03:00
kd-11	c191a98ec3	vulkan API fixes - Fix for texture barriers - vulkan: Rework texture cache handling of depth surfaces - Support for scaled depth blit using overlay pass - Support proper readback of D24S8 in both D32F_S8 and D24U_S8 variants - Optimize the depth conversion routines with SSE - vulkan: Replace slow single element copy with std::memcpy - Check heap status before attempting blit operations - Bump guard size on upload buffer as well	2018-02-16 16:14:54 +03:00
kd-11	3bbecd998a	infinitesimal fixes	2018-02-16 16:14:54 +03:00
kd-11	3406cc9886	gl: Fix save binding state for cube textures	2018-02-16 16:14:54 +03:00
kd-11	bd297d079d	rsx: Minor optimizations	2018-02-16 16:14:54 +03:00
kd-11	a5500ebfa4	rsx: Fix disjoint draw range splitting - Fixes flickering and missing draws in R&C and other games such as Motorstorm Apocalypse and Okami HD when strict mode is disabled	2018-02-16 16:14:54 +03:00
kd-11	a64bea1286	rsx/fp: Discard shaders with undefined (non-existent) writes. On nvidia+vulkan, undefined writes autofill with blue color	2018-02-16 16:14:54 +03:00
kd-11	b67f28e00d	rsx/texture_cache: Do not access non-existent components	2018-02-16 16:14:54 +03:00
kd-11	89c548b5d3	rsx: fbo fixes 2.5 - Implement flush-always behaviour to partially fix readback from a currently bound fbo - Without this, only the first read is correct, as more draws are added the results become 'wrong' - Fixes WCB and cpublit behviour - Synchronize blit_dst surfaces to avoid data loss when gpu texture scaling is used - Its still faster in such cases to disable gpu texture scaling but some types cannot be disabled without force cpu blit (e.g framebuffer transfers) - Memory management tuning - rsx: on-demand texture cache rescanning for unprotected sections - rsx: Only framebuffer resources are upscaled - Do not resize regular blit engine resources - Lazy initialize readback buffer when using opengl -- These measures should help minimize vram usage	2018-02-16 16:14:54 +03:00
kd-11	02e571adbc	gl: Cleanup some more things - Avoid referencing GL's null texture - Improve texture creation helpers - Remove useless __glcheck from GLHelpers.cpp	2018-02-16 16:14:54 +03:00
kd-11	98e50d3064	gl: Reuse framebuffer textures whenever possible	2018-02-16 16:14:54 +03:00
kd-11	f20fd217f8	rsx: Reorganize framebuffer setup code - Fixes some fast paths for framebuffer creation and binding	2018-02-16 16:14:54 +03:00
Nekotekina	d2652aedd2	VK: add two options (workarounds) Force FIFO present mode option is workaround for recent MESA drivers Force primitive restart flag should also work on said drivers	2018-02-11 19:52:14 +03:00
Nekotekina	cce0ad0c35	Clean vm::ps3 namespace use	2018-02-09 17:49:37 +03:00
Nekotekina	76be7d40ac	Remove PSP2	2018-02-09 15:24:46 +03:00
kd-11	e7537cded5	vk: Also discard background if window is too small in vertical axis	2018-02-02 10:07:55 +03:00
kd-11	ea8bdda9a3	rsx/gl/vk: Support for swizzled? context surfaces - For some surfaces, dimensions are passed via the log2 bits rather than surface pitch -- This is similar to the setup for nv406e and probably means the surfaces are padded and swizzled	2018-02-02 10:07:55 +03:00
kd-11	eeb6e29e39	vulkan: implement proper texture read barriers	2018-02-02 10:07:55 +03:00
kd-11	b9cca71c47	gl: API compliance fixes - Do not assume texture2D when creating new textures - Flag invalid texture cache if readonly texture is trampled by fbo memory. Avoids binding a stale handle to the pipeline and is rare enough that it should not hurt performance	2018-02-02 10:07:55 +03:00
Zion Nimchuk	70fe0cc224	Install and load icons from Icons in executable directory, fixes local builds using Icons	2018-01-31 01:01:33 +04:00
kd-11	4f7d3e5dc1	vk: Stuff - Remove subpass dependencies; transitions are handled via exicit imagememrybarriers - Reuse sampler objects whenever possible; create/delete cycles are not free	2018-01-30 21:16:43 +03:00
kd-11	33bcdd476c	glsl/fp/vp: Avoid shader clutter - Do not add unused subroutines in shaders unless necessary -- makes shaders easier to read and disassembled spir-v has less clutter - glsl: Replace switch block with lookup table	2018-01-30 21:16:43 +03:00
kd-11	2e04dceaf0	rsx: misc fixes - Supply explicit options for spv emit allowing optimizations (not yet compiled into the backend) - Add epsilon fix to glslcommon - Fix shader dialog crash when using qt (race condition)	2018-01-30 21:16:43 +03:00
kd-11	648fc92184	rsx/fp/vp: Epsilon value is too large! - Original epsilon value was 1.E-10 which nvidia linux driver could not read properly -- Restores the original value represented in decimal notation	2018-01-30 21:16:43 +03:00
kd-11	c05d164eee	gl: Implement swap control on linux	2018-01-30 21:16:43 +03:00
Juhn	d37aa466ff	Fix native UI save_dialog when there are no saves (#4113 ) - Also implements single action list view with cancel action only	2018-01-30 16:26:00 +03:00
ZeroZero2018	cd8e97a7c6	Fix to B8 format render target swizzling (#4123 )	2018-01-29 21:58:25 +03:00
Zangetsu38	c8965564e4	vk/ogl: Fix regression for G8B8	2018-01-24 20:20:28 +03:00
Jake	efe6a6cf53	rsx: fixed emulated primitive restart	2018-01-24 16:28:09 +00:00
Jake	2f414f96bf	rsx: fix potential hang during thread close	2018-01-24 16:28:09 +00:00
kd-11	4f01794713	Minor fixes - vulkan: Do not assume an aux frame context must exist in a well defined state as set in init_buffers() since the request might be external (via overlays path) - gl: Do not bother waiting for idle before servicing external flip requests - gl: Queue overlay cleanup requests to ensure only glthread attempts touching the context - overlays: Do not compute size metrics for invalid/unsupported glyphs	2018-01-22 11:43:35 +03:00
kd-11	3d9e3a16f1	rsx/gl/vk: Fixes and optimizations - opengl driver optimization for nvidia. On nvidia glTextureBufferRange performance is horrendous -- Initialize texture buffer to whole buffer at startup and use absolute offsets to read data instead -- Over 2x performance in some cases (Resogun, TNT racers) - gl/vk: Do not flip non-existent display buffers. Fixes spec violation at boot in TNT racers demo - whitespace fixes for sys_rsx	2018-01-22 11:43:35 +03:00
kd-11	fcd702c8a6	rsx: Texture format fixes - Implement low bit decode override flags for 2-component textures - Properly implement alot of texture remaps according to the autotest results rsx: Do not unnecessarily shuffle WZYX->RGBA unless we have proof - From looking at format swizzles, this is incorrect	2018-01-22 11:43:35 +03:00
kd-11	743928b379	vk/gl: Preserve clamped z precision to some extent - Use edges of depth range to map clamped stuff Disable range compression on regular draws vs extended range draws - Some applications require full 0-1 usage without compromises. -- TODO: This leaves the extended range z values to fight with regular draws in the .99 - 1.0 range	2018-01-22 11:43:35 +03:00
Zion Nimchuk	49e64b9e82	install icons to /usr/share and load them from there	2018-01-19 12:44:01 +03:00
kd-11	6828fbf658	rsx/texture_cache: Remove hacks; it has been proven that in offsets are in x16 fixed point	2018-01-19 12:03:57 +03:00
kd-11	0a2992839b	rsx/gl/vk: Simulate z clipping with selective depth clamp - The scale offset matrix is fine but on real hardware the z results seem to be independent of near/far clipping distances -- If depth falls within near/far, clamp depth value to [0,1]	2018-01-19 12:03:57 +03:00
kd-11	1a6e53ec98	overlay: Fixes - Add fallback fonts including attempting to find glyphs in dev_flash - Fix vulkan hang on startup if icons are not present	2018-01-19 12:03:57 +03:00
kd-11	9ec2337192	rsx: Synchronization improvements - Always flush the primary queue and wait if not involking readback from rsx thread -- Should fix some instances of device_lost when using WCB -- Marked remaining case as TODO -- TODO: optimize amount of time rsx waits for external threads trying to read	2018-01-19 12:03:57 +03:00
kd-11	cbc8bf01a1	cell/scheduler: Manage thread placement depending on cpu hardware	2018-01-19 12:03:57 +03:00
Megamouse	d238791b7d	RSX: properly handle disconnected pads	2018-01-18 21:18:56 +03:00
kd-11	71f69d1d48	rsx/overlays: Introduce 'native' HUD UI and implement some common dialogs (#4011 )	2018-01-17 19:14:00 +03:00
TGEnigma	28a96f1543	Add option to disable on-disk shader cache (#4041 )	2018-01-15 19:23:36 +03:00
myfreeweb	15d0bdb7cf	Detect Vulkan Xlib/Wayland surface support at runtime (#4048 ) Detect Vulkan Xlib/Wayland surface support at runtime	2018-01-14 22:16:06 +03:00
Jake	0477f8ed3c	rsx: add log for potential source of error	2018-01-14 20:50:55 +03:00
Jake	7ca2c444cc	rsx: Fix depth clipping	2018-01-14 20:50:55 +03:00
Jake	c5074ba81f	d3d12: fix invalid framebuffer crash and shader compile	2018-01-14 20:50:55 +03:00
Jake	ac53fc54dc	rsx: fix image_in arg and swizzle fix	2018-01-14 20:50:55 +03:00
Greg V	fbceec47b8	Add support for Vulkan on Wayland The variable VK_USE_PLATFORM_WAYLAND_KHR is actually used by the Vulkan header, so use it here too.	2018-01-11 12:26:41 +03:00
kd-11	ee009ec99c	rsx: Robustness fixes - Track last working state and reset to it if RSX starts to desync -- This is especially useful when running vulkan since the renderer will easily outpace the rest of the system when merely recording draw commands - Ignore empty sets -- Mark empty/invalid IB sets as having 0 element counts.	2018-01-02 21:17:56 +03:00
lewmpk	d64e79bd9f	fix clang warning: logical-op-parentheses	2017-12-31 22:08:17 +03:00
kd-11	1ea5e7404a	rsx: Workaround for nvidia linux - For some reason, using 1.E-x notation does not work on nvidia linux. Could be a bug in spir-v generator or the driver itself	2017-12-31 12:43:40 +03:00
kd-11	55c324e062	rsx: Invalidate surface configuration if stencil state is changed - Stencil state afects validity of a depth/stencil surface same as depth state	2017-12-31 12:43:40 +03:00
kd-11	d496dbecad	rsx: Implement depth clamping	2017-12-31 12:43:40 +03:00
kd-11	4872be9de3	rsx/vk: Synchronize access to queue submit calls - Should prevent concurrent access on the present/graphics queue	2017-12-31 12:43:40 +03:00
kd-11	688569e5c8	gl: Remove flag test optimization as it does not work properly - Most texture cache operations will break sampler state and the extra complexity is not worth the very small speed bump	2017-12-31 12:43:40 +03:00
kd-11	8b31a9448a	vulkan: Tweaks to sampler state fallback paths - Make all texture access on non-existent textures return 0 - If border color is closer to 0, then set alpha to 0 as well (might break some corner cases with alpha test) - Zero initialize null sampler	2017-12-22 20:08:14 +03:00
kd-11	4819847c46	rsx: Modify semaphore_acquire timeout detection - Take paused state into account - Make timeout configurable	2017-12-22 20:08:14 +03:00
kd-11	d6bc6ec2c1	rsx: fix initial swizzle ordering for render target data	2017-12-22 20:08:14 +03:00
kd-11	320fb5f94c	rsx/fp: Fix alphakill	2017-12-22 20:08:14 +03:00
kd-11	4a0c4259f0	c++ is hard - Remove unnecessary const definitions	2017-12-22 20:08:14 +03:00
Nekotekina	61de20a633	RSX: remove SSSE3 dependency	2017-12-20 00:04:08 +03:00
kd-11	de5dab35e0	rsx: Raise semaphore timeout duration bacause some games are very slow	2017-12-18 10:45:37 +03:00
kd-11	47060cdc5f	rsx/fp: Fix typo	2017-12-18 10:45:37 +03:00
kd-11	7dd349ae8e	Update FragmentProgramDecompiler.cpp	2017-12-18 10:45:37 +03:00
kd-11	4e80858bed	rsx/fp: Hotfix for TEXBEM/TXPBEM	2017-12-18 10:45:37 +03:00
kd-11	e89a035e8b	rsx/fp: Implement TXPBEM	2017-12-18 10:45:37 +03:00
kd-11	f7c52d3bb7	rsx/fp: Implement TEXBEM (untested)	2017-12-18 10:45:37 +03:00
kd-11	6f8dd20f03	rsx/fp: Stuff - Implement BEM - Add LG2 to special instructions	2017-12-18 10:45:37 +03:00
kd-11	b1a1c0251f	rsx: Implement variable point size	2017-12-18 10:45:37 +03:00
kd-11	0d0821e914	rsx: Pause FIFO queue when changing ctrl registers	2017-12-18 10:45:37 +03:00
kd-11	6891323c18	rsx: framebuffer textures do not have mipmaps! - Force mipmap count to 1 if sampling from an RTV/DSV - TODO: Better wcb flush detection, it should be better to re-upload the texture after it has been dwnloaded if expected mipmaps are > 1	2017-12-18 10:45:37 +03:00
kd-11	7c7cd4153e	rsx: Framebuffer setup fixes - Sometimes square renders are done to surfaces with pitch=64 and re-uploaded with swizzle scanning -- This setup avoids discarding targets if they are square and pitch == 64	2017-12-18 10:45:37 +03:00
kd-11	ff0f1510e5	rsx: Minor fixes - Abort nv406e semaphore acquire if the rsx thread stalls/crashes - Fix texture size approximation to take mipmaps into account. Fixes some games hanging with WCB	2017-12-18 10:45:37 +03:00
kd-11	3338fdb936	rsx: Fix RGB565 blits. Data is byteswapped on input - Fixes messed up BG on retroarch glyphs	2017-12-18 10:45:37 +03:00
kd-11	6dfe32c6d2	fix linux builds	2017-12-18 10:45:37 +03:00
kd-11	95966a467e	rsx: Texture cache fixes - Handle blit resources in a more consistent way - TODO: Handle some corner cases (piyotama)	2017-12-18 10:45:37 +03:00
kd-11	ac0022483a	rsx: Implement delayed swizzle remap for blit engine resources - Fixes remap vectors for memory copied via blit engine as it has no context	2017-12-18 10:45:37 +03:00
Jake	d0013679c0	rsx: fix image_in swizzled texture crash	2017-12-08 15:19:17 +04:00
kd-11	0b3fbf1d4c	rsx: Narrow the race condition window further - Needs aliased paging to be implemented to fix properly or a re-entrant global IO lock	2017-12-06 12:55:49 +03:00
kd-11	69eb483089	rsx: Framebuffer setup fix - Stencil flag is meaningless if surface does not support stencil	2017-12-06 12:55:49 +03:00
kd-11	a2b4cf22b5	rsx: Reimplement invalidate_range_base_impl - Avoid unprotecting memory until just before we have to write the data - Avoids race conditions where the caller thread takes too long to enter the second phase and another thread accesses the "bad" memory	2017-12-06 12:55:49 +03:00
kd-11	970d2a06e0	rsx: Properly fix DATA3F_M register alignment	2017-12-04 18:22:18 +03:00
kd-11	9853027f72	rsx/vp: Decide default return values in case of undefined attributes based on location ID - Different default values should be returned for different attributes	2017-12-04 18:22:18 +03:00
kd-11	90c2324e47	rsx: Program cache fixes - Reorganize storage hash vs ucode hash - Scan for actual fragment program start in case leading NOPed code precedes the actual instructions -- e.g FEAR2 Demo has over 32k of padding before actual program code that messes up hashes	2017-12-04 18:22:18 +03:00
kd-11	f5145943b2	d3d12: Fix fragment shader compile	2017-12-04 18:22:18 +03:00
kd-11	960f26f84e	rsx:: fixes for framebuffer setup - provide a DSV if depth and/or stencil testing is enabled	2017-12-04 18:22:18 +03:00
kd-11	cdd4fd9867	rsx/fp: Explicitly insert global functions. - Functions such as pack/unpack ops must exist before the shared gather functions are declared	2017-12-04 18:22:18 +03:00

... 20 21 22 23 24 ...

3700 commits