rpcsx

mirror of https://github.com/RPCSX/rpcsx.git synced 2026-03-17 18:54:51 +01:00

Author	SHA1	Message	Date
kd-11	0072df7f20	rsx/gl: Add basic interpreter support to OGL - Adds basic interpreter functionality. - Flow control and other instructions not yet implemented.	2020-04-30 15:02:59 +03:00
Eladash	833ace1190	rsx: Fix zcull time to not time travel to the future	2020-04-28 21:07:15 +03:00
Megamouse	18219afbf7	Qt: move rsx capture to Utilities menu	2020-04-22 21:43:03 +02:00
Eladash	b94e4247cc	rsx: More strict zcull stats enabling	2020-04-21 16:18:32 +01:00
rexys	8f3b04cbd6	rsx: Fix is_fifo_idle with hle gcm	2020-04-16 12:59:19 +03:00
Megamouse	cf229a8e9f	some more dynamic settings	2020-04-15 18:25:25 +02:00
scribam	2e397e38a4	Typos	2020-04-14 17:06:58 +03:00
scribam	f37adc4188	Add fallthrough attribute	2020-04-14 17:06:58 +03:00
Nekotekina	4d8bfe328b	Replace rotate utils with std::rotl More include cleanup.	2020-04-14 16:05:58 +03:00
Nekotekina	032e7c0491	Replace utils::cntlz{32,64} with std::countl_zero	2020-04-14 16:05:58 +03:00
Nekotekina	d0c199d455	Replace utils::cnttz{32,64} with std::countr_{zero,one} Make #include <bit> mandatory.	2020-04-14 16:05:58 +03:00
Eladash	cb14805d78	rsx fp/vp analyzers: Fix strict type aliasing and improve codegen	2020-04-12 16:48:43 +03:00
Eladash	e407018bb5	rsx: Write ref+get atomically May contribute to better FIFO synchronization in some cases.	2020-04-11 21:21:15 +03:00
Eladash	ff74c241c7	rsx: Fix get_optimal_blit_target_properties for local memory	2020-04-11 21:21:15 +03:00
Eladash	504ba8d824	rsx: Fix grammer issue (binded -> bound)	2020-04-11 21:21:15 +03:00
Eladash	8228fa1ece	sys_rsx: Warn if RSX is not idle during crucial points	2020-04-11 21:21:15 +03:00
Eladash	36fd1d0f0d	rsx: Optimize transform constants load methods (#7992 )	2020-04-09 15:53:43 +03:00
Eladash	f7536bbce0	sys_rsx: Fix gcm events spam In realhw the events are only sent if they are masked in driver_info->handlers as well.	2020-04-07 20:43:28 +03:00
Eladash	3f48450408	sys_rsx: Minor atomicity fixes	2020-04-07 20:43:28 +03:00
Megamouse	078c31c1da	Qt: fix lupdate warnings (used for translation)	2020-04-06 20:59:58 +02:00
Megamouse	b1fdbc7fcc	Move some format functions	2020-04-06 20:59:58 +02:00
Eladash	e7d5d17fd8	rsx: Adjust FIFO recovery to be a bit more merciful	2020-04-05 17:40:23 +03:00
kd-11	0b6e2b26fa	rsx: Fix DST instruction - It's the old-school distance vector, not the more modern distance() function - There is seemingly no glsl function that maps to it directly.	2020-04-05 16:35:20 +03:00
kd-11	b301fecfd8	gl: Fix async shader compiler - Removes glFinish hack. - Adds proper server-side synchronization. - Adds primary context detection to allow worker threads to be identified.	2020-04-05 16:35:20 +03:00
Eladash	72d1efa383	rsx: Batch transform contants load methods	2020-04-05 15:21:56 +03:00
Eladash	72c0aed4c1	rsx: Reset vertex program/constants at each boot	2020-04-02 20:42:12 +03:00
Eladash	c2c5005278	rsx: Fix and improve fp program data invalidation	2020-04-02 20:42:12 +03:00
Eladash	2ed370093e	rsx: Get rid of invalid_command_interrupt_raised	2020-04-02 20:42:12 +03:00
Eladash	d97e9f7b4a	rsx: Batch vertex program load methods	2020-04-02 20:42:12 +03:00
kd-11	69d90f6fec	vk: Remove NVIDIA workaround for broken partial occlusion queries - This bug has been fixed in the latest drivers.	2020-03-31 20:53:12 +03:00
kd-11	8c847d3a4b	vk: Remove RADV workaround regarding renderpass barriers - The situation was clarified in the official vulkan spec to allow this behavior. Barriers are now only inserted by the driver when layout transitions are requested.	2020-03-31 20:53:12 +03:00
kd-11	b327e329d6	vk: Avoid query log spam if no program is loaded	2020-03-31 20:53:12 +03:00
Eladash	4215499b7f	rsx: Fix typo in NV4097_SET_TRANSFORM_PROGRAM range	2020-03-28 11:07:34 +03:00
unknown	049825812e	RSX: Restrict analyser loop error	2020-03-28 09:42:13 +03:00
xddxd	d96dabcd60	rsx: Rename current_instrution to current_instruction (#7883 )	2020-03-28 02:46:48 +00:00
Eladash	9d971e3b07	rsx: More strict infinite desync detection 6 desyncs per second for 1.5 seconds is pretty bad already.	2020-03-26 17:52:45 +03:00
Eladash	38c8dd98b4	rsx: Implement basic infinite FIFO desync detection	2020-03-26 15:22:45 +03:00
Eladash	158e34faca	rsx: Reset all method registers at rsx_state::init()	2020-03-25 17:51:59 +03:00
Eladash	768b4f8c65	rsx: Improve NV308A_COLOR * Fix NV308A_COLOR methods range. * Batch NV308A_COLOR methods execution together. * Fix termination of bind_range<> in rsx methods binding.	2020-03-25 17:51:59 +03:00
Megamouse	fd3522436a	overlays/osk: add more panels	2020-03-25 03:54:49 +01:00
Eladash	08e66ab14c	Minor warning fixes	2020-03-23 21:37:37 +03:00
kd-11	4965bf7d7a	gl/vk: Refactor draw call handling and stub shader interpreter - Refactors backend draw call management to make it easier to extend functionality. - Stubs shader interpreter functionality.	2020-03-23 14:47:28 +03:00
Eladash	cccc32fa9d	sys_lwmutex/lwcond: track lwcond waiters (#7826 ) In lwmutex destroy syscall, wait for pending waiters.	2020-03-23 10:30:17 +03:00
kd-11	12044bd8b0	rsx: Properly calculate vertex range when divisor is active - The upper bound is to be rounded up, not down.	2020-03-22 10:57:47 +03:00
Eladash	9acf8e283d	Fix OSK thread exit condition	2020-03-21 12:37:29 +03:00
Nekotekina	c577bd2111	Implement thread_state::errored State after calling thread emergency_exit() function. Also default-construct thread result in this case.	2020-03-20 21:31:27 +03:00
Megamouse	fd8cda0f2b	overlays/osk: fix selection after changing panels We now try to keep the current x and y selected after panel changes. Also change some copy to ref	2020-03-19 21:10:08 +01:00
Megamouse	c63f77e3b0	overlays/osk: fix full width characters	2020-03-19 21:10:08 +01:00
Megamouse	a1f70bf96e	overlays/osk: do not change the preview text on empty input This prevents that the placeholder disappears	2020-03-19 21:10:08 +01:00
Megamouse	f1127f1894	overlays: implement osk panels	2020-03-19 21:10:08 +01:00
kd-11	d25ba03e82	vk: Lazy evaluate renderpass scope - Spamming the driver with renderpass open/close cycles is bad for performance.	2020-03-15 18:39:40 +03:00
kd-11	7025985c0d	rsx: Improve section scanning when updating surface cache resources in blit engine.	2020-03-15 16:51:23 +03:00
kd-11	a756c0679e	rsx: Implement cross-aspect slice gathering - Fixes a data leak that can happen when a surface is rejected due to aspect mismatch. - Mismatch can lead to rejection due to area covered excluding the RTT and inevitable upload a texture from CPU at the same location. - Overlapping fbo/shader_read resources are not allowed.	2020-03-15 16:51:23 +03:00
Eladash	377e06a4a2	rsx: Fix unknown Blend equation	2020-03-15 09:53:15 +03:00
kd-11	2ae83782e1	vk: Fix potential MTRSX deadlock in case of a race condition	2020-03-13 22:06:04 +03:00
Eladash	f3877d11e8	rsx: Fix initial boolean state of m_textures_dirty and m_vertex_textures_dirty	2020-03-12 21:36:43 +01:00
Eladash	c04abac630	rsx capture: Fix exceptions handler, fix tiny race condition on capture new capture	2020-03-12 21:36:43 +01:00
kd-11	7e9dbeff7b	vk: Fix MTRSX deadlock (#7766 )	2020-03-12 22:29:58 +03:00
Nekotekina	04dedb17eb	Disable exception handling. Use -fno-exceptions in cmake. On MSVC, enable _HAS_EXCEPTION=0. Cleanup throw/catch from the source. Create yaml.cpp enclave because it needs exception to work. Disable thread_local optimizations in logs.cpp (TODO). Implement cpu_counter for cpu_threads (moved globals).	2020-03-12 16:03:08 +03:00
kd-11	47bbfdd2aa	vk: Change texture cache memory management for disposed textures - Use global resource manager instead of using the 2-frame hold behavior. - Fixes high VRAM usage in some games	2020-03-11 16:29:34 +03:00
kd-11	7989de9d16	vk: Properly release dma resources.	2020-03-10 22:02:02 +03:00
Megamouse	3ea94c286b	input/overlays: fix premature pad interception removal shader compilation and trophy notifications shouldn't cancel the pad interception during proper dialogs	2020-03-10 19:04:32 +01:00
kd-11	12b73c8bdc	rsx: Fix copypasta	2020-03-09 17:20:24 +03:00
Eladash	636ed4a48b	HLE cellGcmSys: Avoid calling sys_rsx syscalls in rsx code	2020-03-09 16:07:14 +03:00
kd-11	2985a39d2e	rsx: Rewrite async decompiler	2020-03-09 14:59:25 +03:00
Nekotekina	9dca2887d8	Fixup for Emu.Pause() Remove some reduntant calls. Don't pause on unknown sys_fs_fcntl operation.	2020-03-08 22:03:16 +03:00
kd-11	8214425a3c	rsx: Fix framebuffer native layout for X32_FLOAT - It was not matching the order laid out for normal textures uploaded from CPU.	2020-03-08 11:43:49 +03:00
kd-11	84a542fbce	rsx: Blit engine improvements - Detect writes to the display output memory and handle it specially. It already defines a known 2D region. - Try and detect situations where raw transfers would be of benefit.	2020-03-08 10:30:13 +03:00
Eladash	892f74d762	rsx: Improve frame-limiter (#7723 ) * rsx: Improve frame-limiter accuracy * lv2: Improve lv2_obj::wait_timeout response time for aborting threads * rsx: Make stretch to display area setting dynamic * rsx: Redefine 'auto' frame limiter to obey vblank rate * rsx: Make frame limiter setting dynamic * rsx: Make frame-limiter compatible with dynamic changes	2020-03-08 01:11:35 +03:00
kd-11	149d550f7e	gl: Restore commented out line - Byte order step was disabled for debugging and not restored	2020-03-07 17:23:25 +03:00
kd-11	70f2577b9e	vk/gl: Use best-fit semantics when scanning texture cache for flippable images - Allows sourcing flip data from the blit engine resources which avoids expensive flush and re-upload	2020-03-07 16:58:35 +03:00
kd-11	93295f7f50	vk: Fix image properties for flip temporary images to be samplable. - In case of gamma correction or other effects, they may require shader access. - BGRA8_UNORM is usually safe to use directly without staging memory.	2020-03-07 16:58:35 +03:00
kd-11	1725f7a34b	rsx: Add anaglyph 3D filter	2020-03-07 16:58:35 +03:00
kd-11	6e3406b3f5	video: Allow selection of 3D stereo resolutions	2020-03-07 16:58:35 +03:00
Nekotekina	e4a81b1d13	Move Log.h to util/logs.hpp	2020-03-07 12:29:23 +03:00
Nekotekina	7a8772dafa	Replace std::string::npos with umax	2020-03-05 14:05:23 +03:00
Nekotekina Aux1	250736ece5	Fix warnings in emucore	2020-03-04 21:23:34 +03:00
Nekotekina Aux1	f2f3321952	Fix warnings in VKGSRender	2020-03-04 21:23:34 +03:00
Nekotekina Aux1	c3f3451269	Fix warnings in GLGSRender	2020-03-04 21:23:34 +03:00
kd-11	54775d91dc	rsx/blit-engine: Account for a rare corner case - It is possible to have a RTV<->DSV transfer with compatible-sized formats. Mark the depth size as typeless in such a situation to avoid crossing the aspect barrier with the API.	2020-03-04 21:21:59 +03:00
kd-11	7fe9802f87	vk: Properly use declared pitch when loading simple images	2020-03-01 00:16:52 +03:00
kd-11	14aebeac58	video-out: Allow applications to successfully change display resolution - Avoids a situation where a game configures output correctly but gets back bogus information later when querying. - Should fix games being broken at some resolutions but not others.	2020-03-01 00:16:52 +03:00
kd-11	76bbbe27f1	vk: Fix dma resource leak - Fix broken check; a relic of the past where flush method would reset the fence	2020-03-01 00:16:02 +03:00
kd-11	9af52d12a8	vk: Improve events - Make events properly managed objects. - Add a workaround for AMD's broken event status query	2020-03-01 00:16:02 +03:00
kd-11	5eb314fbbb	vk: Add execution barriers. - Useful for debugging	2020-03-01 00:16:02 +03:00
Nekotekina	8e5a03f171	Use named_thread_group in rsx_cache.h	2020-02-29 16:55:25 +03:00
kd-11	eb140c52a4	rsx: Reset ZCULL statistics at the end of a frame - Workaround for games that leak zpass/zstats. The information is useless anyway without a clear op so it should be fine.	2020-02-29 14:23:52 +03:00
kd-11	198c84cabf	rsx: Fix zcull clear command; do not clear ZPASS when ZSTATS is cleared.	2020-02-29 14:23:52 +03:00
Eladash	8762f2a588	Use more starts_with	2020-02-29 13:06:14 +03:00
kd-11	08f3460365	vk: Fixup for RCB/RDB in special cases - Images must be in TRANSFER_DST_OPTIMAL or GENERAL layouts to call the image upload routines.	2020-02-29 12:13:11 +03:00
kd-11	cb047fcc75	rsx: Disable zstat checks to avoid unnecessary stream splitting (#7624 )	2020-02-28 20:27:31 +03:00
Nekotekina	ac2581659a	RSXOffload: fix dma_manager::sync() freeze on exit Its logic was completely broken.	2020-02-28 19:55:43 +03:00
Nekotekina	f335d034fc	Fix RSX Offloader thread exit (MTRSX fix) Hangs on exit if MTRSX is enabled.	2020-02-28 19:43:42 +03:00
Nekotekina	ecd68dfc70	overlays: add "thread bits" to wait on and avoid lockup Add TLS variable to store its own bit.	2020-02-27 19:14:08 +03:00
Megamouse	ee46ad1ca9	move overlays code to headers	2020-02-26 23:43:18 +01:00
gamerforEA	93552a5958	Apply some Clang-Tidy fixes	2020-02-27 00:38:55 +03:00
gamerforEA	c0fbf3091e	Remove unnamed namespaces from headers	2020-02-27 00:38:55 +03:00
gamerforEA	49294a3dd2	Add missing include guards	2020-02-27 00:38:55 +03:00
Nekotekina	5094ab8283	Fix RSX Offloader thread name	2020-02-26 21:57:01 +03:00
Nekotekina	b35a5982e8	Fix one bug with MsgDialog thread (freeze on exit) Forgot to check thread state	2020-02-26 21:23:30 +03:00
kd-11	569e1c2df6	rsx: Fix typo. Noted by github user @gamerforEA	2020-02-26 19:40:35 +03:00
kd-11	6e9392fb45	rsx: Restructure ZCULL query triggers - Both ZCULL stats and ZPASS stats require hardware queries, but ZCULL stats should not contribute to ZPASS stats and vice versa! - Disables hardware queries for ZCULL stats by themselves, we cannot generate them correctly anyway and no game so far has been found to actually use them. Should lessen the load on the backend for games that do not actually require it.	2020-02-26 19:40:35 +03:00
Nekotekina	df1813b4e2	overlays: hotfix for waiting on thread_count	2020-02-25 23:43:05 +03:00
Nekotekina	ff16e678a5	Add thread_count instead of former thread pool	2020-02-25 23:16:55 +03:00
Nekotekina	982856e70d	overlays: remove unused threadpool	2020-02-25 22:56:50 +03:00
Nekotekina	144c20649f	Try to fix msg dialog breakage	2020-02-25 22:50:44 +03:00
Megamouse	e719bcf338	overlays: add layer modes to osk	2020-02-25 21:57:49 +03:00
Megamouse	3f4226b70e	overlays: Fix find and replace regression	2020-02-25 21:57:49 +03:00
Megamouse	620cfd5063	overlays: move code to overlay_utils.cpp	2020-02-25 21:57:49 +03:00
Megamouse	2341749485	overlays: add overlay_osk.h	2020-02-25 21:57:49 +03:00
Nekotekina	9c9c2eb2c9	Fix wrong g_fxo->init_crtp name, use just init<>	2020-02-25 14:07:50 +03:00
Nekotekina	318a364d09	Try to fix OSK	2020-02-25 14:03:13 +03:00
Nekotekina	fa02a04baa	Add g_fxo->init_crtp to simplify thread construction	2020-02-25 11:51:41 +03:00
kd-11	cd40bc8c61	overlays: Avoid race condition between rendering and layout operations for system widgets - System widgets are callable from outside RSX code. - Responding to draw requests while setup is in progress can cause malformed cached output - Fixes glitched layouts for system message dialogs	2020-02-24 23:33:47 +03:00
kd-11	f6ebd88687	overlays: Ditch wstring for u32string - Turns out wstring is not the same as u32string on windows.	2020-02-24 23:33:47 +03:00
Megamouse	f7666f44da	Untangle GUI and input includes	2020-02-24 16:31:01 +01:00
Eladash	522daf5eac	rsx: Fix NULL renderer	2020-02-23 19:57:55 +03:00
Nekotekina	8b4b859091	Remove "thread_ctrl::spawn"	2020-02-23 15:03:38 +03:00
Nekotekina	18db020b93	Fix warning in RSXOffload.cpp (rewrite thread)	2020-02-23 14:19:23 +03:00
Nekotekina	7069e7265f	RSX: move g_dma_manager to g_fxo	2020-02-23 13:12:50 +03:00
kd-11	fa41297b27	overlays/trophy: Migrate to multibyte strings	2020-02-22 15:07:14 +03:00
kd-11	b8f51398b7	overlays/save_dialog: Migrate to multibyte strings	2020-02-22 15:07:14 +03:00
kd-11	cb2129c7e4	overlays/osk: Migrate to multibyte encoding	2020-02-22 15:07:14 +03:00
kd-11	703ec9f896	overlays: More unicode utilities	2020-02-22 15:07:14 +03:00
kd-11	19350d024b	overlays: Font system improvements - Add support for Hangul blocks (korean) - Restructure font fallback system to allow the user to 'install' fonts if missing. Should allow fonts to work with no firmware on open systems like linux	2020-02-22 15:07:14 +03:00
kd-11	8e68427daf	overlays: Add basic font substitution system and separate JPN from Latin-1 set - Gets JP glyphs to render correctly, but the generalization may negatively affect other CJK glyph sets. PS3 doesn't seem to use other glyph sets much however.	2020-02-22 15:07:14 +03:00
kd-11	6220206cbc	vk: Implement 2D array textures required for new font subsystem	2020-02-22 15:07:14 +03:00
kd-11	1df1ceb4ea	gl: Support new glyph format with array textures	2020-02-22 15:07:14 +03:00
kd-11	6178a0ab25	overlays: Migrate to wide-char strings	2020-02-22 15:07:14 +03:00
Nekotekina	5e75a0c497	Disable cotire on travis Make some workarounds for clang because it poorly supports -Wold-style-cast	2020-02-21 17:03:54 +03:00
Nekotekina	972e0ab31d	Remove -Wno-reorder and make it an error	2020-02-21 15:20:34 +03:00
Nekotekina	92e3eaf3ff	Fix signed-unsigned comparisons and mark warning as error (part 2).	2020-02-19 22:54:58 +03:00
Nekotekina	771eff273b	First part of fixing sign-compare warning (inside be_t).	2020-02-19 22:54:58 +03:00
Eladash	df8d0cde4a	RSX/SPU: Accurate reservation access	2020-02-19 18:11:30 +00:00
Megamouse	fe75311be2	move config structs to own files and clean up some headers	2020-02-17 15:08:17 +03:00
kd-11	5e6b1003ec	vk: Only declare explicit subpass dependencies for RADV	2020-02-16 18:00:06 +03:00
kd-11	23f1515448	vk: Explicitly declare null subpass dependencies - We do not want any actual dependencies, but it turns out removing them entirely makes the driver add even worse dependencies.	2020-02-15 21:45:25 +03:00
Eladash	9344b21484	rsx: Unify FIFO recovery methods TODO: Maybe consider fifo stack content when recovering.	2020-02-14 17:11:26 +03:00
Eladash	07f300a14e	rsx: ZCULL typo fix	2020-02-14 17:11:26 +03:00
Nekotekina	bcbe324534	geometry.h: make conversion operators explicit It requires static_cast<> to call them.	2020-02-11 13:21:45 +03:00
Eladash	dcb30df7c8	rsx capture: Fix capture recovery after a crash	2020-02-10 21:39:39 +00:00
Eladash	bdab26ec09	rsx: rewrite io mappings Along with some with fixes to cellGcmSys HLE.	2020-02-10 21:39:39 +00:00
kd-11	f47333997f	rsx: Validate memory blocks before checking for overlap	2020-02-10 21:48:35 +03:00
kd-11	3787108ee7	rsx: Typo fix in audit condition	2020-02-10 21:48:35 +03:00
Nekotekina	034267adb2	Compilation fix	2020-02-10 16:57:56 +03:00
kd-11	efc8c3f4a9	vk: Fixup for VK_ERROR_SUBOPTIMAL_KHR - break from a switch does not break out of the external scope!	2020-02-09 13:45:30 +03:00
kd-11	792c481f6d	rsx/overlays: Fix clipped rendering of UI elements - Take viewport offset into account when applying window transforms. This is necessary because gl_FragCoord is based on the framebuffer and not the viewport.	2020-02-09 12:55:56 +03:00
Eladash	9d1bb60ad7	cellGcm HLE: fix cellGcmMapMainMemory Fix arguments order, softcode RsxReports::report offset.	2020-02-08 22:18:56 +03:00
Eladash	b7043ce000	Make rsx::get_address report caller location	2020-02-08 22:18:56 +03:00
kd-11	c64935f9dd	rsx: Clean up graphics state notifications and add notification for change in point size - Adds a backend notification when point size changes. - Refactors all those separate notifiers into one reusable template.	2020-02-08 18:13:05 +03:00
kd-11	54da9ac7e5	overlays: Fixup - Avoid calling join on self thread. - Avoid use-after-free.	2020-02-07 19:28:41 +03:00
kd-11	e45360de2b	overlays: Fix use after free - Overlay can be closed when secondary thread is asleep! Wait for it to wake before proceeding with deletion.	2020-02-07 16:15:02 +03:00
kd-11	d59c449ff6	vk: Remove an overzealous assert	2020-02-07 16:15:02 +03:00
kd-11	0bba04ef8d	vk: Fix a bug in RCB/RDB when MSAA is set to disabled. - Initially MSAA option was hardcoded to be always enabled, this bug is a remnant of that time.	2020-02-06 17:54:05 +03:00
kd-11	43dae6c14d	gl: Implement RCB/RDB	2020-02-06 17:54:05 +03:00
kd-11	2b5c24b304	gl: Fix memory barrier implementation and stub for RCB/RDB - It's a miracle it even compiled	2020-02-06 17:54:05 +03:00
kd-11	50b1e26b17	gl: Fix a long-standing regression with typeless transfer caused by a typo. - The parameters for the final upload should be 'unpack_info' not 'pack_info'!	2020-02-06 12:44:46 +03:00
kd-11	18e0559438	gl: Fix per-level sub-image sizes to comply with OpenGL guidelines for compressed textures	2020-02-06 12:44:46 +03:00
kd-11	3cc42c1bf8	gl: Fix broken image transfer operations	2020-02-05 18:18:09 +03:00
kd-11	b6422c9a33	rsx: Fixup - Destination Y coordinate must be 'rebased' onto the current slice by subtracting its offset. Only the local path was affected this time	2020-02-05 18:18:09 +03:00
Nekotekina	c0f80cfe7a	Use attributes for LIKELY/UNLIKELY Remove LIKELY/UNLIKELY macro.	2020-02-05 10:42:34 +03:00
Nekotekina	1a78e0e80c	Make RPCS3 compile in C++2a mode	2020-02-04 23:43:55 +03:00
kd-11	9d9b5c4d66	rsx: Rewrite coverage test to take sum of areas into account. - TODO: A proper sweep algorithm to calculate sum of overlapping rectangles	2020-02-04 16:20:52 +03:00
kd-11	b9ec012922	rsx: Allow for proper data checks when WCB/WDB is enabled	2020-02-04 16:20:52 +03:00
Nekotekina	c4a01875d0	Space fix commit	2020-02-03 11:16:26 +03:00
Silent	7f4e546f19	Protect m_storage.find(key) to fix a race	2020-02-02 22:28:14 +03:00
kd-11	7d2ed9200d	rsx: Remove sections that are wholly inherited by new blocks - Allows sections reclaimed by the surface store due to overlap/inheritance to be identified and removed. - Additionally, potentially lowers the number of flushes required per block with multiple overlaps improving efficiency and theoretically performance.	2020-02-01 15:14:29 +03:00
Nekotekina	15391f45d0	Modernize RSX logging (rsx_log variable)	2020-02-01 11:52:22 +03:00
Nekotekina	1d0f359406	logs: add more log channels instead of GENERAL	2020-01-31 16:44:48 +03:00
kd-11	36d5db7f30	rsx: Plug texture data leak in the 'exact match' path. - Followup to previous texture data leak fix for the replaced section path.	2020-01-31 14:56:53 +03:00
kd-11	c9e35926f5	rsx: Preserve pixel data when splitting sections - Ironically rhis data leak is caused by trying to fix another type of data leak	2020-01-30 21:07:36 +03:00
Eladash	92466165f6	Increase Maximum Vblank Rate and Clocks Scale Allow x30 times the speed of vblank rate + clocks scale of original PS3. In theory a 60 fps limit game which scales frame limit perfectly with vblank rate can be played at up to 1800 fps with this change. And: * Fixed lv2 sleep with Clocks Scaling * Make these settings dynamicaly adjustable. * Avoid code duplication	2020-01-29 21:42:41 +01:00
kd-11	1206a5d4b7	rsx: Tweak blit engine heurestics a bit - Reject writes to RTT if the source data is of unknown origin. non-RTT data and only 1 line in length is suspicious and often GPU data like programs or other rendering inputs.	2020-01-29 12:54:06 +03:00
Nick Renieris	1e69de1205	overlays/perf: Graph label tune-up Place graph text on top, split in 2 lines, center it horizontally. Also if it's wider than the graph, match up graph's width to it.	2020-01-26 17:55:11 +01:00
kd-11	79216917b3	rsx: Workaround for broken rtt resampling - Avoids WCB requirement for now to keep res scaling working correctly. - TODO: Fix this properly	2020-01-26 13:58:48 +03:00
kd-11	698702cd4a	vk: Fix DMA data leak - There still does not exist a ranged flush implementation which is required. - TODO: Implement this properly	2020-01-26 13:58:48 +03:00
kd-11	1166ae19bb	vk: Use appropriate layouts depending on use case when creating new textures to avoid needless barriers	2020-01-26 13:58:48 +03:00
kd-11	44f2cacf7b	rsx: Blit engine tuning - Attempt to identify blit operations that will be flushed immediately after and just do them on CPU instead if the transformation is trivial. - If only a single blit section is contributing to an atlas merge op, the threshold should be 100%. The only acceptable result here is a truncation.	2020-01-26 13:58:48 +03:00
kd-11	7a275eaa3a	rsx: Fix incomplete blit operations getting used as texture inputs - Raise passing 'score' from 50% to 90% to filter out very incomplete merge operations. - Catch unfit sections passing the match test; possible for blit_dst data but will likely be always harmless. Disabled in release builds by default.	2020-01-26 13:58:48 +03:00
Maksim Derbasov	1abdee242a	small improvement (#7288 ) * small improvement * comments addressed Co-authored-by: kd-11 <15904127+kd-11@users.noreply.github.com>	2020-01-22 12:28:48 +00:00
kd-11	adcc3e9c4b	rsx: Optionally sync on texture read semaphore - Some games use texture semaphore for zcull sync which is rather bizzare. However, it works on realhw as the depth test happens before fragment shader completion - Due to the high performance penalty incurred by this act, this behavior is only enabled by the "strict rendering mode" option.	2020-01-21 22:21:51 +03:00
Megamouse	4dbad6cce6	fix some random warnings	2020-01-19 16:38:17 +01:00
kd-11	22ca2827de	rsx: Improve window border detection and clearing - Improves logic to detect if the frame requires letterboxing and properly clears the background appropriately.	2020-01-18 19:52:52 +03:00
kd-11	5e0ca4c0c4	rsx: Fixup for missing visuals when framebuffer is larger than requested display dimensions.	2020-01-18 19:52:52 +03:00
kd-11	48407752a6	formatting: Unify indentation type in the newly added files to tabs	2020-01-18 19:52:52 +03:00
kd-11	bad4d1ff05	rsx: Improve present image scanning - Adds support for partial (letterboxed) source images by taking insets into account. - Bugfix for potential access violation when capturing screenshot on vulkan	2020-01-18 19:52:52 +03:00
kd-11	7453e46a7c	rsx: Refactor out complex present code into separate files - Also restructures present code to have image lookup in a separate re-usable function.	2020-01-18 19:52:52 +03:00
kd-11	b36b9e4822	vk: Fixup for total number of combined samplers using the dynamic binding structure	2020-01-18 11:17:19 +03:00
kd-11	0a2b6a290d	vk: Fixup - Scaling is not needed for a direct typeless transfer!	2020-01-17 14:31:14 +03:00
Megamouse	449cbb7281	Qt: use persistent_settings for playtimes	2020-01-17 07:43:10 +01:00
kd-11	9b34f00241	vk: Optimize image transfers - Adds the same optimization/simplification steps to complex image transfer routines. Whenever possible, multi-step transfers are collapsed into a single operation.	2020-01-16 22:29:26 +03:00
kd-11	82af17beb1	gl: Optimize image operations - Avoid double transfers where a transfer to a temp image is done without scaling and then a secondary transfer follows. Combines the two steps into one whenever possible which can significantly alleviate bandwidth problems at higher resolutions. Significant speedup, upto 90% in some cases (PDF, PDF2)	2020-01-16 22:29:26 +03:00
kd-11	47b196e9d0	rsx: Fix uninitialized variable	2020-01-16 17:57:31 +03:00
kd-11	db014d8a58	rsx: Fix section length calculations when generating new blit targets.	2020-01-16 17:57:31 +03:00
kd-11	621fab2ad9	vk: Fix D32S8 interpolation by using integer interpolation instead of floating point - Interpolating floats is not the same as interpolating their bits! Use integer format to interpolate linearly for D32F formats instead of using R32F as intermediary	2020-01-16 11:12:08 +03:00
kd-11	086ecf4ba6	vk: Add some missing image memory barriers causing artifacting on AMD cards - There needs to be a memory barrier after each step. - TODO: Optimize scale_typeless_safe function	2020-01-16 11:12:08 +03:00
kd-11	309251ce7a	rsx: Touch locked dst memory after blit transfer operations in case it is locked by WCB/WDB	2020-01-16 11:12:08 +03:00
kd-11	74ad525566	vk: Fixup for cs_scatter job - Access to the stencil output has to be atomic as each 'word' is shared among 4 adjacent texels - TODO: Can be optimized using mirrored buffer views	2020-01-15 21:12:51 +03:00
Eladash	85695c8bac	rsx: FIFO wake-up pause control	2020-01-15 19:54:23 +03:00
kd-11	2984300385	vk: Fix invocation alignment to support non-power-of-2 alignment	2020-01-15 15:42:36 +03:00
kd-11	ac4cadf538	vk: Fix word index counting for shuffle tasks	2020-01-15 15:42:36 +03:00
kd-11	175f78f5b3	vk: Lower default compute heap size to 64M - There is no need to guess and use a large memory footprint as the heap is now dynamic.	2020-01-15 15:42:36 +03:00
kd-11	3d96fe79cc	vk: Implement dynamic sized compute heap - Implements a dynamically sized compute heap to allow growing up the size if it is too small.	2020-01-15 15:42:36 +03:00
Eladash	1ccb3c4492	rsx: Verify local memory offset	2020-01-15 13:23:56 +03:00
kd-11	8bbda3dedb	vk: Restructure command queue flushing behavior to avoid deadlock - Queueing commands on the offloader is a good idea but unfortunately page faults can still happen causing a cyclic dependency and eventual deadlock. Characterized by a vk::wait_for_event timed out error accompanied by severe hitching. - Drain the fault-able commands before pushing a submit operation to the queue. If a fault is in progress, bypass the queue system and submit raw. Technically this is incorrect but there isn't much that can be done about it right now.	2020-01-14 14:32:40 +03:00
kd-11	db5d03c340	vk: Generate dynamic binding table based on the capability of the drivers - This alleviates constraints imposed on shaders to allow running on some not-so-great platforms.	2020-01-09 15:38:23 +03:00
kd-11	ef3b0db7d8	vk: Workaround for NVIDIA occlusion query failure - When using partial results on NVIDIA, a non-zero result is returned even when the draw is fully occluded. This, I believe, violates spec which says the partial result shall be between 0 and the final result.	2020-01-08 19:02:45 +03:00
kd-11	3f34a0196c	overlays/osk: Add linear fade-in/out effect to OSK	2020-01-07 21:31:19 +03:00
kd-11	ecf00be155	rsx: Add color interpolation animation - Adds color interpolation and modulation pass and refactors the code a bit. Elements with this pass applied have their color modulated by the animated color from the pass. Modulation transform is multiplicative.	2020-01-07 21:31:19 +03:00
Nick Renieris	5bace118a7	overlays: Redesign animation system (add easing functions, fix bugs) Instead of speed, direction and distance, the user now specifies start/end offsets and how much time the transition should take. Fixes: - Stuttering caused from framerate estimation. - An edge case where animations would go over their supposed limit. Adds: - The ability to specify arbitrary easing functions for the animations - Implemented quadratic ease in and ease out and cubic ease in/out. - Usage of cubic ease in/out in the trophy notification	2020-01-06 22:42:07 +03:00
Nick Renieris	28770c1580	overlays: Move vertex & vector utility classes to new file	2020-01-06 22:42:07 +03:00
Nick Renieris	192912131e	rsx: Update vblank count in LLE mode	2020-01-06 22:42:07 +03:00
Dravonic	94d2f97f27	Multithreaded shader compliation follow-up (#7190 ) * Multithreaded load pipeline entries shader compliation stage Co-authored-by: kd-11 <15904127+kd-11@users.noreply.github.com>	2020-01-06 21:59:59 +03:00
kd-11	7f09def94e	rsx/vp: Properly initialize output registers. - All registers tested on hw show contents to be 0, 0, 0, 1. Make default output registers match this pattern.	2020-01-05 18:06:08 +03:00
kd-11	bdb5115c7f	rsx/overlays: Improve space usage on trophy dialog - Slightly increases the size of the trophy dialog and the font size. The old dimensions did not work with some libre fonts causing alignment errors and other problems.	2020-01-04 16:36:49 +03:00
kd-11	3ada97d2d3	rsx/overlays: Implement trophy notification queue - Allows to display more than one trophy at a time. Trophy notifications will simply get queued up and displayed at appropriate time.	2020-01-04 16:36:49 +03:00
kd-11	31b07fece5	rsx/overlays: Add support for animations - Adds animation support. This commit adds the base framework and implements a translate animation used to slide elements around the screen. This is then used to implement the sliding animation for the trophy notification.	2020-01-03 20:33:32 +03:00
Megamouse	5e7d25ad35	overlays: refactor shader loading dialogs	2020-01-03 14:22:40 +01:00
Megamouse	d94d094a7e	overlays: fix non-interactive dialog loops	2020-01-03 14:22:40 +01:00
Megamouse	c9aee27d48	VK: remove unused init function declaration	2020-01-03 14:22:40 +01:00
kd-11	d12762414a	vk: Change default vertex output value - Prefer w!=0 to avoid a situation where xyz/w = nan. More of a theoretical problem, but some calculations break down in such a situation.	2020-01-03 10:35:53 +03:00
kd-11	7786681954	rsx: Improve MTRSX synchronization - Properly synchronize DMA transfers when handling RSX pipeline barriers. Texture read barrier is used to signify completion of DMA routines and is often used to signal that Cell can overwrite vertex data!	2020-01-03 10:35:53 +03:00
kd-11	c4e59b5115	vk: Clamp depth export in FS - PS3 matches OGL behavior where writing to the depth export register results in clamping.	2020-01-01 22:39:20 +03:00
Eladash	9690854e58	Some cleanup * Prefer default initializer over std::memset 0 when possible and more readable. * Use std::format in trophy files name obtaining. * Use vm::ptr<>::operator bool() instead of comparing vm::ptr to vm::null or using addr(). * Add a few std::memset calls in hle where it matters (or in some places just to document an actual firmware memcpy call).	2019-12-31 22:27:27 +03:00
kd-11	915cf0bae8	vk: Do not leak mapped memory	2019-12-31 13:56:14 +03:00
Megamouse	c4b4ce46b8	cellSaveData: don't pause apps during dialogs	2019-12-29 14:22:58 +01:00
kd-11	24cb48971e	vk: Fix cb chunk synchronization deadlock	2019-12-29 13:49:46 +03:00
kd-11	e1b734fd12	rsx: Fix linux build	2019-12-29 13:49:46 +03:00
kd-11	ed2bdb8e0c	rsx: Zcull synchronization tuning - Also fixes a bug where sync_hint would erroneously update the sync tag even for old lookups (e.g conditional render using older query)	2019-12-29 13:49:46 +03:00
kd-11	fdb638436f	rsx: Add toggle for zcull sync behaviour - Adds a relaxed sync mode where ZCULL reports are lazily nudged into flushing and the main core does not actually wait for the event to finish before proceeding - Can drastically improve performance in cases where the game actually does not utilize the report data	2019-12-29 13:49:46 +03:00
kd-11	9f94a6dc11	vk: Refactoring and optimizations to query handling - Caches query results when looking up report availability to avoid entering driver code twice. - Minor code restructuring	2019-12-29 13:49:46 +03:00
kd-11	55ad9244c0	vk: Switch occlusion pool to FIFO rather than LIFO to avoid hard stall	2019-12-29 13:49:46 +03:00
kd-11	cdd9c12132	vk: Emulate conditional rendering for AMD	2019-12-29 13:49:46 +03:00
kd-11	93895838c7	vk: Implement hw conditional rendering	2019-12-29 13:49:46 +03:00
kd-11	a51395370e	vk: Implement multithreaded command submission - A few nagging issues remain, specifically that partial command stream largely caused by poor synchronization structures for partial CS flush and also the fact that occlusion map entries wait on a command buffer and not an EID!	2019-12-29 13:49:46 +03:00
kd-11	5be7f08965	rsx: Restructure ZCULL report retirement - Prefer lazy retire model. Sync commands are sent out and the reports will be retired when they are available without forcing. - To make this work with conditional rendering, hardware support is required where the backend will automatically determine visibility by itself during rendering.	2019-12-29 13:49:46 +03:00
kd-11	8dfea032f2	rsx: Remove deprecated do_method path that has been superceded by c++ inheritance for many years	2019-12-29 13:49:46 +03:00
Megamouse	ef6f565dbd	silence some annoying warnings	2019-12-28 15:40:57 +01:00
Emmanuel Gil Peyrot	9b77febd10	RSX: Remove two empty cpp files	2019-12-23 00:02:57 +03:00
linkmauve	e9c5c6e6bf	Move input to its own directory (#7126 )	2019-12-22 17:39:42 +01:00
Eladash	db4041e079	Implement rounded_div Round-to-nearest integral based division, optimized for unsigned integral. Used in sceNpTrophyGetGameProgress. Do not allow signed values for aligned_div(), align().	2019-12-20 14:47:04 +03:00
Emmanuel Gil Peyrot	e30173a835	rsx: Make X11 optional on Linux This makes it possible to build rpcs3 on a pure Wayland system, without the Xlib installed.	2019-12-20 10:48:03 +00:00
Nekotekina	321f7e7197	Fix missing-braces warnings	2019-12-13 03:21:43 +03:00
kd-11	73236efe58	vk: Remove some outdated code (#7060 )	2019-12-12 16:29:55 +03:00
Eladash	6a926daee7	rsx: Delay FIFO recovery point creation if is in in_begin_end scope (#7080 )	2019-12-12 15:38:56 +03:00
Eladash	7260af032e	rsx: Ignore or recover from unknown primitives This also fixes a bug when recovering FIFO or creating such recovery point inside in_begin_end == true scope.	2019-12-11 00:11:12 +03:00
Nekotekina	835892aa51	C-style cast cleanup VII	2019-12-05 02:10:15 +03:00
Nekotekina	d2fd3c6bc4	Commit `377e7d2a73`	2019-12-04 21:32:08 +03:00
Nekotekina	377e7d2a73	C-style cast cleanup VI	2019-12-04 17:56:22 +03:00
scribam	2eaaf5b132	vk: Add sampleRateShading to the list of device enabled features	2019-12-04 12:59:38 +03:00
Nekotekina	185c067d5b	C-style cast cleanup V	2019-12-03 17:23:00 +03:00
Nekotekina	28eacc616a	C-style cast cleanup III	2019-12-01 00:32:44 +03:00
Nekotekina	5b9df53c13	C-style cast cleanup (partial) Replace C-style casts with C++ casts.	2019-11-29 00:35:23 +03:00
Megamouse	f2b530823b	overlays: add dynamic switch for perf overlay	2019-11-27 10:34:03 +01:00
kd-11	8ca53f9c84	rsx: Remember to min-max the anchor indices of a polygon or triangle fan	2019-11-24 19:01:57 +03:00
kd-11	429a76a140	rsx: Remove redundant check	2019-11-23 16:11:18 +03:00
kd-11	41e7d2aa0a	rsx: Select correct image aspect for blit engine targets.	2019-11-19 13:18:15 +03:00
kd-11	fd751e3e7b	rsx: Improve blit format mismatch detection	2019-11-19 13:18:15 +03:00
kd-11	41c3180276	rsx: Fix invalid format checks for DMA sections which are typeless	2019-11-19 13:18:15 +03:00
kd-11	9dab0575fa	rsx: Add missing format check for the RTV<->DSV transfer case - TODO: Rewrite resource handling routines	2019-11-18 13:17:00 +03:00
kd-11	4a0e1c79ed	rsx: Improve format validation for blit engine - Check all possible cases where format mismatch is possible. - Warn if a slow path is going to be taken. Should help with future optimizations.	2019-11-18 13:17:00 +03:00
kd-11	c415578e79	vk: Clamp buffer row length to never be less than declared width - Fixes some games with broken textures	2019-11-18 13:17:00 +03:00
kd-11	2408922806	rsx: Do not ignore clamping for some routines that do not have implied range	2019-11-18 13:17:00 +03:00
kd-11	c10aa360b1	rsx: Remove more deprecated methods	2019-11-18 13:17:00 +03:00
Megamouse	a17a5a76a0	overlays: avoid division by zero	2019-11-15 14:53:18 +01:00
Megamouse	fb96047d2f	overlays: add settings for overlay graphs	2019-11-15 14:53:18 +01:00
Megamouse	dd1707bd46	overlays: fix center options when graphs are shown	2019-11-15 14:53:18 +01:00
Megamouse	d6b0361a02	overlays: perf_metrics_overlay to seperate header this is done to prevent severe conflicts with upcoming changes	2019-11-15 14:53:18 +01:00
Anuskuss	7e31c30133	Intel iGPU needs workaround on Windows	2019-11-15 12:08:16 +03:00
Nick Renieris	cc59d319e1	overlay: Performance graphs	2019-11-12 20:43:09 +01:00
kd-11	8234bdb8f0	vk: Check for heap change events after a grow to avoid spec violations - Avoid referencing the old buffer in stale views. Status can be set globally if requested during heap creation.	2019-11-10 17:53:12 +03:00
kd-11	5968427a2f	vk: Initialize queries before use - The spec does not guarantee that queries are initialized. In fact, it now says all queries must be reset before they are used for the first time.	2019-11-10 17:53:12 +03:00
kd-11	8ea9bc9874	vk: Reduce memory allocation sizes of default heaps - The heaps will grow as desired, no need to overallocate to cater to the most resource-hungry games	2019-11-10 17:53:12 +03:00
kd-11	0a32d478df	vk: Enable auto-growing of the data heaps for the performance case	2019-11-10 17:53:12 +03:00
kd-11	357e0d2097	vk: Implement explicit runtime flags to manage events like heap sync	2019-11-10 17:53:12 +03:00
kd-11	f359342721	rsx: Implement mutable ring buffers with grow support	2019-11-10 17:53:12 +03:00
kd-11	5f39a594ac	rsx: Clean up some unused legacy methods unnecessary after d3d removal	2019-11-10 17:53:12 +03:00
Emmanuel Gil Peyrot	56f82d2701	rsx: Wrap gsl::span definition into Utilities/span.h	2019-11-09 20:00:50 +01:00
Emmanuel Gil Peyrot	f76720ceb0	Remove extraneous ::narrow<int>() calls GSL’s gsl::span didn’t use the correct type for its index_type, which is why they were needed.	2019-11-09 19:30:06 +01:00
Emmanuel Gil Peyrot	72cdf0b04c	Replace gsl::span’s implementation with tcbrindle’s This implementation optimises correctly on all relevant compilers, unlike GSL’s which gave extremely slow code on any compiler other than MSVC. Supersedes #6948.	2019-11-09 19:30:06 +01:00
Emmanuel Gil Peyrot	ef368c5171	rsx: Replace gsl::byte with C++17’s std::byte	2019-11-09 19:30:05 +01:00
kd-11	7072489a6e	rsx: Implement point sprite coordinate generation - When the point sprite flag is set, overrides the input similar to the 2D mask. The returned X and Y values are always the gl_PointCoord values for the fragment. - Stacks with the 2D mask to override the z and w coordinates.	2019-11-09 12:50:53 +03:00
kd-11	63673b1a9f	rsx: Implement full color remap for the D24S8->ARGB8 converter	2019-11-08 19:11:59 +03:00
kd-11	8d1505752f	rsx: Validate depth test setup to avoid address contention	2019-11-07 11:32:44 +03:00
kd-11	508ffcb775	vk: Compute kernel fixups - Adhere to workgroup count limits as exposed by the GPU vendor. They already execute properly even when going beyond the limits but this removes validation noise. - Fix invocation counts for deswizzle kernel. The count was incorrect if blocksize was not 4, causing a bunch of useless work to be done.	2019-11-05 22:07:22 +03:00
kd-11	99d71fdc2a	vk: Implement layer batching for the GPU swizzle decoder - Handles all LODs per layer meaning cubemaps are now fully handled in 6 passes instead of 6 * (log2(width)) passes. - Handles all LODs of a 3D texture in one pass as well. - The improvements do warrant dropping down the number of allowed compute invocations a bit	2019-11-05 22:07:22 +03:00
kd-11	7a0b94f343	vk: Minor compute optimizations - Remove use of uniform buffers for compute static data. Use push constants instead. - Minor touchups to the deswizzle code to avoid redundant data copies.	2019-11-05 22:07:22 +03:00
kd-11	1266b63135	vk: Enable gpu deswizzling	2019-11-05 22:07:22 +03:00
kd-11	9cd3530c98	rsx: Set up framework for hw deswizzle	2019-11-05 22:07:22 +03:00
kd-11	57d3c9e171	rsx: Take empty queries into account for engines that spam report reads. - Some games will spam the report queue with requests but have zpass statistics enabled.	2019-11-04 18:48:41 +03:00
kd-11	2a8f2c64d2	rsx: Implement report transfer deferring - Allow delaying report flushes triggered by image_in or buffer_notify - When the report is ready, all the delayed transfers will automatically be done. - TODO: Make this configurable?	2019-11-04 18:48:41 +03:00
kd-11	3e0f9dff4d	vk: Improve zcull synchronization - Use zcull sync hints more aggressively	2019-11-04 18:48:41 +03:00
kd-11	fe3c290d03	vk: Reimplement occlusion result reading - Implement partial result reads	2019-11-04 18:48:41 +03:00
kd-11	51e0eaaddc	rsx: Implement backend notification for upcoming zcull reads	2019-11-04 18:48:41 +03:00
kd-11	df63de8f16	rsx: Allow u32 restart index with full index width	2019-11-04 16:56:34 +03:00
kd-11	6b3af09fa5	vk: Improved crash message for missing MSAA features	2019-11-04 16:56:34 +03:00
kd-11	bbed791ee0	vk: Add explicit support for identity image views - Allows bypassing all remap shenanigans to make some operations that rely on the raw image to work correctly.	2019-11-01 19:35:46 +03:00
kd-11	63bbf11a76	vk: Add video out calibration pass - Adds gamma correction and RGB range filters to output to match PS3	2019-10-31 14:43:24 +03:00
kd-11	78aefe5b5e	rsx/overlays: Add support for other primitive types other than triangle_strips	2019-10-31 14:43:24 +03:00
Nekotekina	e3e7051ed3	Minor optimization in BufferUtils.cpp Don't use PSHUFB for horizontal operations. Utilize PHMINPOSUW to compute max as well: + sse41_hmin_epu16 + sse41_hmax_epu16	2019-10-30 18:52:34 +03:00
Nekotekina	b1968769b7	Minor cleanup in BufferUtils.cpp Replace inline asm with intrinsic using target attribute trick.	2019-10-30 17:53:51 +03:00
linkmauve	cfd5cf6bdb	Optimise primitive_restart::upload_untouched() (#6881 ) * rsx: Optimise primitive_restart::upload_untouched() with SSE4.1 This optimisation is only applied when skip_restart is false. I’ve only tested the u16 codepath, as it is the one used in NieR. In some very unscientific profiling, this function used to take 2.76% of the total frame time at the save point of the port town, it now takes about 0.40%. * rsx: Mark all SSE4.1 functions with attributes on gcc and clang This assures the compiler we will take care of only calling these functions after having checked that the CPU does support these instructions. * rsx: Add an AVX2 implementation of primitive restart ibo upload * rsx: Remove redefinition of SSE4.1 instructions Now that clang is aware that our functions are compiled with SSE4.1, it lets us generate this code using its intrinsics. * rsx: Optimise vector to scalar conversion This is done using minpos and srli intrinsics and generate less code than before. Thanks Nekotekina for the suggestion!	2019-10-30 16:42:44 +03:00
kd-11	35794dc3f2	vk: Add checks for alphaToOne support - This feature is very rarely used, as alphaToCoverage is commonly used as a replacement for blending, not in addition to it.	2019-10-30 01:06:28 +03:00
kd-11	eda09489b2	vk: Optionally ignore depth bounds testing on hardware that does not support it.	2019-10-29 20:03:54 +03:00
kd-11	7a5c20ef85	vk: Minor spec touchups - Simplify active instance management. While multicontext support will be required in future, this is better done with multiple logical devices rather than multiple instances. - Destroy the WSI surface on exit - Enable depthBoundsTest explicitly. TODO: Properly check for supported features.	2019-10-29 20:03:54 +03:00
kd-11	aa3eeaa417	rsx: Separate subresource_layout:dim_in_block and subresource_layout::dim_in_texel - These two are not always linked when working with compressed textures. The actual texels extend past the actual size of the image if the size is not aligned. e.g if height is 1, the real height is 4, but its not possible to determine this from the aligned size. It could be 1, 2, 3 or 4 for example. - Fixes image out-of-bounds writes when uploading from CPU	2019-10-29 20:03:54 +03:00
Eladash	42fc698186	rsx: Enable primitive restart index only when needed (#6889 ) * rsx: Enable primitive restart index only when needed * rsx: Use if with initializer in read_put()	2019-10-28 23:16:27 +03:00
kd-11	479d92d075	vk: Fix uninitialized (and wrong) variable access	2019-10-28 15:20:45 +03:00
kd-11	b0708367c2	vk: Round lod bias to the nearest 0.5 to lower number of permutations when nearest mipmap sampling is used - The lambda values will be rounded to the nearest integer anyway	2019-10-28 15:20:45 +03:00
kd-11	3e8dfede1c	vk: Modify sampler cache to uniquely identify all the input parameters - Avoids iteration when variable mipmap counts or lod bias parameters change	2019-10-28 15:20:45 +03:00
kd-11	ad2add9574	rsx:: Use fcmp correctly	2019-10-28 15:20:45 +03:00
kd-11	d04241ad25	rsx: Allow compressed textures to be unaligned in size - Align based on row length but let the texture itself be of arbitrary dimensions	2019-10-28 15:20:45 +03:00
Emmanuel Gil Peyrot	69e9ee26f6	rsx: Make input_is_swizzled a template parameter This lowers the relative cost of this function from ~2.25% to ~1.80% on gcc 9 which I found quite surprising, some of it probably gets inlined better in the callers, but I haven’t been able to isolate which parts.	2019-10-28 13:28:51 +03:00
kd-11	d53d7bb598	vk: Restore vega native use of FP16 in shaders - AMD proprietary drivers should work fine	2019-10-23 12:20:06 +03:00
Emmanuel Gil Peyrot	54d95373d0	Support fullscreen properly on Wayland The current behaviour when going fullscreen from windowed was to keep the previous size of the swapchain, with black borders on all sides, which looks quite ugly. The root of this issue is that rpcs3 only checks for frame resize if vkQueuePresent() returns VK_SUBOPTIMAL_KHR, which drivers can’t do on Wayland, see https://gitlab.freedesktop.org/mesa/mesa/issues/1979	2019-10-23 12:19:46 +03:00
kd-11	e04b6cd7c0	rsx: Copypasta fix - r1 is always float4 never half4. Its a full-width register unlike the other outputs which are optionally half-width.	2019-10-23 00:50:24 +03:00
kd-11	00bc3fe658	Drop d3d12 backend	2019-10-22 21:45:14 +03:00
Emmanuel Gil Peyrot	14c63ec014	Fix misleading indent.	2019-10-22 16:11:43 +03:00
Eladash	586fe11e22	Fix cellGcm HLE regression Also correct flags.	2019-10-22 13:45:09 +03:00
Eladash	945abcc6cd	rsx: Align down index array offset * Also use improved to_be_t<> template (recetly ignoring one byte long types) for vm gsl::byte referencing, remove redundent narrow<> cast (same type)	2019-10-22 13:45:09 +03:00
kd-11	3bb70e837a	vk: Silly copypasta	2019-10-22 13:44:49 +03:00
kd-11	0b2f9f0f17	rsx: Add support for delayed shader discard. - Noticed a glitch on AMD hw and windows drivers where discard seems to affect entire 4x4 cells. - Dead fragments (outside the primitive boundary) could have their discards trigger as they do not have proper access to variables. - This introduces dead fragments along triangle edges, causing a diagonal line pattern across the screen that is very annoying.	2019-10-22 13:44:49 +03:00
kd-11	901942f24a	rsx: Replace pointless f32[4] restriction on texture parameters. - Use a struct instead to improve readability and remove pointless OpBitCast	2019-10-22 13:44:49 +03:00
kd-11	f7842b765f	rsx: Implement packed format renormalization - Renormalizes arbitrary N-bit values as 8-bit normalized. - NV hardware performs integer normalization at 8 bits if the size is less than 8. - This can cause significant arithmetic drift because the error is multiplied by a huge number when sampling.	2019-10-22 13:44:49 +03:00
Eladash	29cddc30f0	rsx: Fix vblank signals flood after Emu.Resume()	2019-10-21 15:31:45 +03:00
Eladash	5de0005f5a	rsx: Report full method range on invalid methods Also report full command on fifo desync event for the first time	2019-10-21 15:31:45 +03:00
eladash	730e9cde84	sys_rsx: Improve allocations and error checks * allow sys_rsx_device_map to be called twice: in this case the DEVICE address retrived from the previous call returned * Add ENOMEM checks for sys_rsx_memory_allocate and sys_rsx_context_allocate * add EINVAL check for sys_rsx_context_allocate if memory handle is not found * Separate sys_rsx_device_map allocation from sys_rsx_context_allocate's * Implement sys_rsx_memory_free; used by cellGcmInit upon failure * Added context_id checks * Throw if sys_rsx_context_allocate was called twice.	2019-10-21 15:31:45 +03:00
kd-11	3c44065684	gl: Fix copypasta - MSAA is still unimplemented in OGL	2019-10-20 21:38:40 +03:00
kd-11	f40f2c6215	vk: Fix minification filter description for NEAREST_MIPMAP_NEAREST. Just a typo. - Also remove mipmap filter for CONVOLUTION	2019-10-20 21:38:40 +03:00
kd-11	09de3b7974	rsx: Tweak behaviour of the "Use GPU texture scaling" option - If either source data or dest is a render target, do image operations on the GPU same as before - If swizzle is desired, use CPU fallback - If no scaling and no format conversion is required, use CPU fallback - If scaling is desired and the transfer target is in local memory, use the GPU - When doing trivial copies, use the routine in rsx_methods instead of duplicating code. Also has the benefit of better range checking.	2019-10-20 21:38:40 +03:00
kd-11	868547aec8	rsx: Minor improvement to fbo region invalidation - When commiting a block as fbo, keep blit_dst data as well. - Avoids removing (and losing data from) blit targets that just happen to share a page with a framebuffer.	2019-10-20 21:38:40 +03:00
kd-11	996534c559	rsx: Fixup for aspect mismatch	2019-10-20 15:25:07 +03:00
Eladash	d4ba7f37b6	rsx util: Implement decode_fxp<>	2019-10-18 15:41:39 +03:00
kd-11	299b98b30a	vk: Disable mipmap sampling if sampling mode is does not have a mipmap filtering mode. - GL_LINEAR and GL_NEAREST always sample LOD0 so make vulkan behave the same way	2019-10-18 14:46:37 +03:00
kd-11	404073c74a	rsx: Force-align compressed formats to 4x4 texel blocks and disable 1D compressed textures. - The PS3 allows defining 1D compressed images but this obviously doesn't work well on desktop.	2019-10-18 14:46:37 +03:00
kd-11	eff4e95c99	rsx: Minor cache fixup for cyclic references. - Logic was broken by mipmaps PR. Do not issue a texture barrier if a temp copy is being done.	2019-10-18 14:46:37 +03:00
kd-11	bd1bcc6be7	vk: Remove a redundant memory barrier	2019-10-18 14:46:37 +03:00
kd-11	70642484cd	vk: Check for cyclic references if sampler is marked as do-not-cache. - Usually an indication of surface/texture cache interaction.	2019-10-18 14:46:37 +03:00
kd-11	eee2237e19	rsx: Track uncached cache resources - Uncacheable resources can be reused as soon as they're made visible to the draw call. - Since they're likely to be reused every draw call until the shader changes, it is important to reuse as much as possible	2019-10-18 14:46:37 +03:00
kd-11	decf9cfcf6	rsx: Notify the backend to release or delete temporary surfaces after we're done with them.	2019-10-18 14:46:37 +03:00
kd-11	97ed95d21b	vk: Add video memory manager to monitor VRAM usage	2019-10-18 14:46:37 +03:00
kd-11	1046184dd0	rsx: Fix some uninitialized variables flagged by valgrind	2019-10-18 00:32:38 +03:00
kd-11	5af8a9fbbc	rsx: Fix decoding of some fixed point texture parameters - Checked envydocs and found the correct format as fixed-point 4.8 with optional sign bit	2019-10-17 18:18:00 +03:00
kd-11	a936e43ff6	rsx: Fixup for slice gathering for structures with multiple mipmap levels - TODO: Proper multi-level assembly for non-2D structures	2019-10-17 18:18:00 +03:00
kd-11	e47b4ffb8f	rsx: Fix rsx capture crash. - Pixel coordinates are top-left not bottom-right - Solves out of bounds access	2019-10-17 18:18:00 +03:00
kd-11	e166dbccc8	rsx: Fix visibility of blit destination targets	2019-10-17 18:18:00 +03:00
kd-11	0c35595ce2	rsx: Remove the alpha-to-coverage hack that was added to hide the missing mipmaps in games - Moves to a purely stochastic function using dithering to simlulate coverage	2019-10-17 18:18:00 +03:00
kd-11	f0ed0285f3	rsx: Implement range-based subresource descriptor cache - The previous address-based approach was pretty awful when it comes to invalidating	2019-10-17 18:18:00 +03:00
kd-11	fbb9ed4e25	rsx: Add explicit range to cached subresource descriptors	2019-10-17 18:18:00 +03:00
kd-11	c9e3a321b2	rsx: Fixup for surface cache scanning - Fix regression when gathering cubemaps	2019-10-17 18:18:00 +03:00
kd-11	1ac976771c	rsx: Add some texture search options for the cache - Potentially optimizes texture cache searching using explicit options	2019-10-17 18:18:00 +03:00
kd-11	840b52fe80	rsx: Implement mipmap gathering from texture cache	2019-10-17 18:18:00 +03:00
kd-11	d6d8766f8d	rsx: Refactoring - Move some helper routines out of the cache core - Prep for multi-layered image search	2019-10-17 18:18:00 +03:00
kd-11	cb362b4085	rsx: Runtime check on RTT cast	2019-10-17 02:30:03 +03:00
kd-11	5c7bbb3354	vk: Fixup - Removes incorrect line writing stencil flags to a regular texture.	2019-10-17 02:30:03 +03:00
kd-11	d29b6cdb59	vk: Proper workaround for VEGA float16_t bugs	2019-10-16 22:40:50 +03:00
kd-11	a6e143254a	vk: Add workaround for broken format conversion in older GeForce cards	2019-10-16 22:40:50 +03:00
kd-11	4f088a102c	vk: Add kepler and maxwell tables	2019-10-16 22:40:50 +03:00
plappermaul	2171ffdab2	minor optimization for FIFO_control::read_put() (#6768 )	2019-10-14 21:26:31 +03:00
kd-11	42aa4c5000	gl: Vendor-specific tuning	2019-10-13 19:00:05 +03:00
kd-11	776fa54d22	gl: Fix missing case	2019-10-13 19:00:05 +03:00
kd-11	27f48fbc06	gl: Rewrite image transfer operations to support image subregions - Working exclusively with full sized images is very expensive	2019-10-13 19:00:05 +03:00
kd-11	d9a9766e41	gl: Refactoring and fallback support for compute acceleration	2019-10-13 19:00:05 +03:00
kd-11	b39bfa02a6	gl: Windows bringup	2019-10-13 19:00:05 +03:00
kd-11	105d4b51e6	gl: Use compute shaders for typeless texture decode	2019-10-13 19:00:05 +03:00
kd-11	7a6e2e716f	gl: Add a framework for compute shaders	2019-10-13 19:00:05 +03:00
Markus Stockhausen	4d99169d51	Patch v2 for vkCreateInstance() as requested	2019-10-11 21:16:36 +03:00
Markus Stockhausen	8adcb8046b	Patch for vkCreateInstance() patch as requested	2019-10-11 21:16:36 +03:00
Markus Stockhausen	f5817cb430	Error handling for vkCreateInstance() Cry in log if initialization failed.	2019-10-11 21:16:36 +03:00
Eladash	397007cf8b	rsx: Fix FIFO_DRAW_BARRIER substituation	2019-10-11 12:34:53 +03:00
Eladash	9242f16560	rsx: Improve FIFO recovery from flip	2019-10-10 19:34:23 +03:00
Eladash	06017cb14e	rsx: Recover from invalid writes to CELL_GCM_NV4097_SET_INDEX_ARRAY_DMA Also: Trigger a FIFO recovery when encountering an invalid method.	2019-10-10 19:34:23 +03:00
Eladash	2eaf5df60b	rsx: Register some more methods	2019-10-10 19:34:23 +03:00
kd-11	305a5bd717	typo fix	2019-10-05 12:01:46 +03:00
kd-11	4a19a2dd24	rsx: Explicity describe transfer regions for both source and destination blocks	2019-10-04 18:10:46 +03:00
kd-11	7aed9c3f13	gl: Add missing input declarations for 2-sided lighting	2019-09-30 21:52:43 +03:00
kd-11	88229f4716	gl: Remember to unbind attachments from active framebuffer after clear - If a stale reference is left lying around (e.g the texture bound to depth has been deleted and we attach a color image) no operations actually take place. glCheckFramebufferStatus also does not catch this problem.	2019-09-30 21:52:43 +03:00
Eladash	0b2fa6ffdc	rsx: Flush FIFO GET before smeaphore_acquire	2019-09-30 17:30:15 +03:00
Eladash	70b4ae6bd6	rsx: Optimize FIFO PUT masking	2019-09-30 17:30:15 +03:00
kd-11	bcf8799079	rsx: Fix missing point size export - Sometimes program-point-size is enabled, but the vs does not actually write to the point size register. In this case, pass the incoming point size along instead of the default register init.	2019-09-30 01:40:04 +03:00
Eladash	319fc8c55d	rsx: Mask FIFO PUT on rsx execution	2019-09-29 13:05:24 +03:00
Eladash	822287b418	rsx: Avoid unsigned/signed mismatch with fifo ret addr	2019-09-29 13:05:24 +03:00
kd-11	8cfd3b56d6	vk: Increase wait timeout in case of problematic GPU loads causing heavy stutter - When compiling LLVM objects, it is possible to starve the driver thread and cause the timeouts to trigger - Observed in RE6 when using SPU LLVM since the game generates a very large number of objects "infinitely"	2019-09-29 11:39:22 +03:00
kd-11	ef5b56bc48	rsx: Align width properly when normalizing to avoid fractional results being lowered to 0	2019-09-29 11:39:22 +03:00
kd-11	69c090b14a	vk: Check frame descriptors before rendering in case of a flip request between begin() and end() - There is no reason to delay async flip requests since most of the work can be handled during rendering anyway	2019-09-29 11:39:22 +03:00
kd-11	1464069476	rsx: Restructure deferred flip queue handling - Allows frameskipping to occur naturally if RSX thread is bombarded with flip requests but just jumping to the last one if possible - See request_emu_flip() for async frame submission and implicit skipping - Also allows display queue to fill faster than the flip thread can drain the queue	2019-09-28 21:13:56 +03:00
Nekotekina	bd1a24b894	Tidy endianness support (se_t) implementation Move se_t and se_storage to util/endian.hpp Use single template instead of two specializations. Add minor optimization for MSVC. Remove v128 dependency. Try to enable intrinsics for unaligned data. Fix minor bug in u16/u32/u64 specializations.	2019-09-28 15:39:50 +03:00
kd-11	2275259bf5	rsx: Properly scale overlay passes to match drawable area	2019-09-28 13:24:14 +03:00
kd-11	28534e8833	gl: Remove a debug print	2019-09-28 13:24:14 +03:00
kd-11	e53e98749f	rsx: Add missing initialization	2019-09-27 21:07:56 +03:00
Nekotekina	3c72069ae6	cellOskDialog: use g_fxo	2019-09-26 23:26:36 +03:00
Nekotekina	5f9c5e8765	Use g_fxo for rsx::thread	2019-09-26 23:26:36 +03:00
kd-11	ee0633f43a	vk: Add turing workaround - Turing crashes if using the depth->color transfer hack	2019-09-26 20:12:25 +03:00
kd-11	acc986be3f	vk: Add chip family detection	2019-09-26 20:12:25 +03:00
Jan Beich	5ec35c7daa	rsx: unbreak build with Clang 9 ld: error: rpcs3/CMakeFiles/rpcs3.dir/main_application.cpp.o: unable to find library from dependent library specifier: opengl32.lib ld: error: rpcs3/Emu/librpcs3_emu.a(GLGSRender.cpp.o): unable to find library from dependent library specifier: opengl32.lib ld: error: rpcs3/Emu/librpcs3_emu.a(GLRenderTargets.cpp.o): unable to find library from dependent library specifier: opengl32.lib ld: error: rpcs3/Emu/librpcs3_emu.a(GLVertexBuffers.cpp.o): unable to find library from dependent library specifier: opengl32.lib	2019-09-24 01:00:45 +03:00
Megamouse	7193d407b9	Input: Remove unused flush member	2019-09-20 22:12:40 +02:00
kd-11	1a892c6b1b	rsx: Avoid recursion in flip handler	2019-09-20 15:08:41 +03:00
Megamouse	aa7eb1536a	overlays: fix enter button assignment in osk	2019-09-20 10:53:09 +02:00
kd-11	e0005ec347	rsx: Refactoring and improvement - Separate displayed statistics from actual backend statistics. Allows asynchronous flipping to work correctly as it just uses display stats. The real stats are used by the frame scope marker to determine behavior like engaging the FIFO optimizer or skipping draw calls correctly.	2019-09-19 23:10:09 +03:00
kd-11	2c76f47eec	rsx: Restructure flip code and frame scoping - Add an explicit frame scope marker tied in with the queue_prepare command Since queue_prepare is emitted at the end of a frame, it can be used as end-of-frame in games that emit this - If this command is not emitted, fifo flatenner and frameskip will not work	2019-09-19 23:10:09 +03:00
Nekotekina	a4951ec407	Use g_fxo for global lv2_memory_container	2019-09-18 21:24:04 +03:00
kd-11	bd4d86f87a	vk: Properly test MSAA sample mask when switching between states inside a RSX renderpass. - Before, these changes would be lost if the same RTT config was used with varying mask setups	2019-09-18 15:42:59 +03:00
kd-11	c59cb1bdd3	rsx: Allow only sse4.1 capable CPUs to take the accelerated index path - Older sets lack the required min/max functionality	2019-09-13 12:28:52 +03:00
kd-11	52e8747b83	rsx: Workaround for exit deadlock - Avoids games locking up when the stop button is pressed	2019-09-12 23:32:21 +03:00
kd-11	cc313b052f	rsx: Improve hit testing when scanning for overlapping surfaces - Calculate exact sizes when doing hit tests to avoid false negatives - Defer page checking until actually require to do memory setup - Introduce align2 helper to do non-pow2 alignments	2019-09-12 23:32:21 +03:00
kd-11	9842823a8c	rsx: Check if memory actually exists when overallocating blit targets	2019-09-12 23:32:21 +03:00
kd-11	cd1345b6bb	rsx: Do not use nul section if resolution scaling is active on a surface	2019-09-12 23:32:21 +03:00
kd-11	858014b718	rsx: Experiments with nul sink	2019-09-12 23:32:21 +03:00
kd-11	212ac19c11	vk: Reimplement DMA synchronization	2019-09-12 23:32:21 +03:00
kd-11	f06559412e	vk: RDB fixup	2019-09-12 23:32:21 +03:00
kd-11	7fdb4976d8	rsx: Remove log spam for cond render	2019-09-12 14:08:21 +03:00
kd-11	60845daf45	rsx: Improve use of CPU vector extensions - Allow use of intrinsics when SSSE3 and SSSE4.1 are not available in the build target environment - Properly separate SSE4.1 code from SSSE3 code for some older proceessors without SSE4.1	2019-09-12 14:08:21 +03:00
kd-11	27af75fe71	rsx: Fixup for blit engine when moving inverted regions - Properly calculate overlap range when sections are inverted - Simplify transfer logic for inverted regions	2019-09-11 23:30:55 +03:00
kd-11	412c620b9d	rsx: Allow sampling from shader_read resources for blit engine - With harmonization between all texture types implemented, there is no difference between blit_engine_src and shader_read for supported formats - Adds extra format filtering to ensure no conflicts when copying data	2019-09-10 16:54:02 +03:00
kd-11	75fcfac00e	rsx: Modify find_cached_texture to respect gcm_format. Can pass 0 for "dont care"	2019-09-10 16:54:02 +03:00
kd-11	d1603fbb0b	vk: Crop malformed image descriptors - Some image descriptors (lle vdec?) are malformed with pitch being smaller than width - Crop these for now pending hardware tests	2019-09-08 18:22:27 +03:00
kd-11	f53361b966	rsx: Fix fast texture copy when src_pitch != width * block_size - Happens on mipmapped linear images	2019-09-08 18:22:27 +03:00
kd-11	0af9685381	rsx: Deprecate surface_transform::argb_to_bgra which is no longer required. - vulkan now uses native swizzle mapping for both surface and texture	2019-09-08 13:56:41 +03:00
kd-11	312bf6840e	vk: Fix surface_transform::argb_to_bgra transfers when no scaling is requested	2019-09-08 13:56:41 +03:00
kd-11	cbce309199	vk: Fix depth_stencil scaling	2019-09-08 13:56:41 +03:00
kd-11	48a5cd545f	gl: Do not byteswap uint24_8 as it needs a custom 8_24 decoder	2019-09-08 13:56:41 +03:00
kd-11	440d58f2ff	vk: Batch compute jobs when doing texture upload - Reduces overall number of invocations	2019-09-07 16:23:20 +03:00
kd-11	6aa0b49dbc	vk: Prefer using native alignment when uploading. - Allows using fast copy paths and reduces memory and compute footprint	2019-09-07 16:23:20 +03:00
kd-11	a3a0cb8c17	rsx: Minor texture optimizations	2019-09-07 16:23:20 +03:00
kd-11	efa501dac6	rsx/vp: Set default inputs to (0, 0, 0, 1) - From some hw tests, it seems this is the default.	2019-09-06 17:08:28 +03:00
kd-11	f8dbe281a5	glsl: Explicitly declare const inputs as such - Avoids copying the values to temp variables before invoking function calls - Generates shorter, cleaner AST and SPV bytecode	2019-09-06 17:08:28 +03:00
kd-11	14aa3b3360	vk: Remember to allocate enough vertex layout storage objects! - vertex_layout_storage descriptors were added but the descriptor count was not updated	2019-09-05 19:43:39 +03:00
kd-11	360c0e9af6	vk: Restructure commandbuffer scoping to allow faults in vertex upload - Defer renderpass open to allow recovery after fault in the middle of vertex upload	2019-09-05 19:43:39 +03:00
kd-11	9dc06cef7f	rsx: Do not include ro data when attempting to do section merge - Avoids crazy situations like trying to merge from a 3d or cubemap in memory	2019-09-02 16:49:04 +03:00
kd-11	e99e8460fe	rsx/texture_cache_utils: Warnings cleanup	2019-09-01 18:59:50 +03:00
kd-11	27fabd7607	rsx/ring_buffer: Warnings cleanup	2019-09-01 18:59:50 +03:00
kd-11	33609717f8	rsx/cache: Warnings cleanup	2019-09-01 18:59:50 +03:00
kd-11	f8617500b5	rsx/methods: Warnings cleanup	2019-09-01 18:59:50 +03:00
kd-11	7f7b499303	rsx/util: Warnings cleanup	2019-09-01 18:59:50 +03:00
kd-11	0158a88c88	rsx/textures: Warnings cleanup	2019-09-01 18:59:50 +03:00
kd-11	401bd9112a	rsx/prog: Warnings cleanup	2019-09-01 18:59:50 +03:00
kd-11	652f18ebaa	rsx/buffers: Warnings cleanup	2019-09-01 18:59:50 +03:00
kd-11	6504daa713	overlays: Warnings cleanup	2019-09-01 18:59:50 +03:00
kd-11	94656ac1e3	rsx/vp: Warnings cleanup	2019-09-01 18:59:50 +03:00
kd-11	0ee9d7b46d	rsx/fp: Warnings cleanup	2019-09-01 18:59:50 +03:00
kd-11	756fdedbf6	vk: Warnings cleanup	2019-09-01 18:59:50 +03:00
kd-11	a7b9ff33d8	gl: Warnings cleanup	2019-09-01 18:59:50 +03:00
kd-11	7f99de36c1	rsx: Fixup for surface_target_a flag being broken - While the mask for surface_a is at index 0, the surface cache expects the order to be maintained correctly! Set the correct mask since surface store now checks each RTT individually	2019-08-30 21:46:19 +03:00
kd-11	61af2b7dfc	vk: Workgroup tuning for different vendors	2019-08-30 21:46:19 +03:00
kd-11	99fb6d6a5d	rsx: Allow GPU-accelerated stream manipulation when doing texture uploads	2019-08-30 21:46:19 +03:00
kd-11	e0a7912d7c	rsx: Check for stencil writes when determining zeta_write flag	2019-08-30 21:45:41 +03:00
kd-11	04c808b8ab	rsx: Fixup for MRT color write lookup and surface_target_a	2019-08-28 16:12:10 +03:00
kd-11	e55d216619	rsx: Workarounds for some buggy games - Replace assert with log message until hardware testing confirms findings	2019-08-28 14:54:51 +03:00
kd-11	e334a43169	rsx: Fix surface cache hit tests - Avoid silly broken tests due to queue_tag being called before pitch is initialized. - Return actual memory range covered and exclude trailing padding. - Coordinates in src are to be calculated with src_pitch, not required_pitch.	2019-08-28 14:54:51 +03:00
JohnHolmesII	cca6a19cdd	Fix improper string concatenation in rsx_decode.	2019-08-28 01:26:14 +03:00
kd-11	2962e05f26	rsx: Implement per-RTT color masks - Also refactors and simplifies some common code in surface store and rsx core	2019-08-27 21:59:02 +03:00
kd-11	27aeaf66bc	gl: Restructure buffer objects to give more control over usage - This allows creating buffers with no MAP bits set which should ensure they are created for VRAM usage only - TODO: Implement compute kernels to avoid software fallback mode for pack/unpack operations	2019-08-27 21:59:02 +03:00
Nekotekina	dd79a5efb7	Remove fxm::make_always	2019-08-27 03:50:15 +03:00
Nekotekina	d2eba2387b	Use g_fxo for display_manager	2019-08-27 03:50:15 +03:00
Nekotekina	928719b658	Use g_fxo for rsx::avconf	2019-08-27 03:50:15 +03:00
Nekotekina	38a06c4b14	Use g_fxo for SysRsxConfig Rename to lv2_rsx_config	2019-08-27 03:50:15 +03:00
kd-11	3317e13b64	rsx: Hotfix for semaphore timeout bug - Add pending flip requests as a reason to invoke the RSX local task handler and release the vblank semaphore	2019-08-26 22:33:29 +03:00
Megamouse	32b5b11a83	cellSaveData/overlays: prevent possible array out of bounds in list view	2019-08-26 21:09:20 +02:00
kd-11	eed32cf3a4	rsx: Decompiler fixups and improvements - Fix 2D coordinate sampling of W coordinate. W is actually HPOS.w and not 1. Z is however always 0. - Optimize register usage a bit Disassembling compiled SPV shows that global declaration results in less ops than using inout modifiers. Modifiers generate extra mov instructions.	2019-08-26 20:03:31 +03:00
kd-11	3e28e4b1e0	rsx/decompiler: Restructure program register behavior - Fix reading of varying registers in FP Different registers have different behavior - Always write to varying registers. If a register is not written to, it is initialized to (0, 0, 0, 1) - Reimplements two-sided lighting correctly without hacks - Also bumps shader cache version	2019-08-26 20:03:31 +03:00
kd-11	fe6ff8622a	rsx: Decompiler fixups for conditional execution - Cond actually obeys vector mask	2019-08-26 20:03:31 +03:00
kd-11	f9aea076ae	rsx: Implement depth_buffer_float support. - Since this is transparent to the application at all time, it only becomes a problem when doing memory transfer or DEPTH->RGBA conversion in shaders.	2019-08-26 20:03:31 +03:00
kd-11	9d981de96d	rsx: Fix offloader deadlock - Do not allow offloader to handle its own faults. Serialize them on RSX instead. This approach introduces a GPU race condition that should be avoided with improved synchronization. - TODO: Use proper GPU-side synchronization to avoid this situation	2019-08-25 22:09:20 +03:00
Megamouse	896cfd2ade	cellSaveData/cellMsgDialog: implement cellSaveDataDelete	2019-08-22 08:05:12 +02:00
Megamouse	2d29a33ea8	cellSaveData/overlays: initialize with focused entry	2019-08-22 08:05:12 +02:00
Megamouse	b3c1759853	cellSaveData/overlays/Qt: fix some warnings and a possible nullptr deref	2019-08-22 08:05:12 +02:00
kd-11	7c5bde4aeb	rsx: Update tag timestamp to match newest inherited data - Avoids memory appearing older when used for depth test without depth write The write_barrier before the call will inherit new data but the tag will not update as no new information is added.	2019-08-21 21:17:15 +03:00
kd-11	c67c97844e	rsx: Fixup for blit engine range calculations	2019-08-21 21:17:15 +03:00
kd-11	5d1b7eb945	rsx: Fix reference leaks in texture_cache<->surface_cache communication - Properly commit orphaned blocks not invalidating existing cache structures - Do not ignore overwritten objects when commiting as unprotected fbo. Avoids stale references to invalidated surface objects.	2019-08-21 21:17:15 +03:00
kd-11	ca8b0da141	gl: Invalidate range before reading to prevent deadlock	2019-08-21 21:17:15 +03:00
kd-11	141072023b	rsx: Fix handling of ARGB8 memory - Load into memory as straightforward BGRA - Fixes a bug in vulkan caused by byte shuffling in blit engine vs shader access - Removes the need for memory shuffling when transferring into a rendertarget	2019-08-21 21:17:15 +03:00
kd-11	9cd5325962	rsx: Free memory 'held hostage' by storage sections in the surface cache - Once the memory has been captured by another surface, release the allocation	2019-08-21 21:17:15 +03:00
kd-11	be98554b40	rsx: Fix surface split logic - Calculations are supposed to be done based on the properties of the outgoing surface	2019-08-21 21:17:15 +03:00
kd-11	67dac94704	rsx/fp: Zero-initialize FragDepth register to match hw	2019-08-21 21:17:15 +03:00
kd-11	35e61c77e0	gl: Fixup for D24S8 readback	2019-08-21 21:17:15 +03:00
kd-11	dca29def5e	rsx: Temporary workaround for race condition in blit engine	2019-08-18 20:45:48 +03:00
kd-11	5e299111cc	rsx/vk: Restructure surface access barriers and implement RCB/RDB - Implements render target data load (aka Read Color Buffer/Read Depth Buffer) - Refactors vulkan surface barrier to be much cleaner. - Removes redundant surface barrier invocations after doing a merged load from surface cache. - Adds explicit access modes when gathering surfaces from cache.	2019-08-18 20:45:48 +03:00
kd-11	dfe709d464	rsx: Surface cache restructuring - Further improve aliased data preservation by unconditionally scanning. Its is possible for cache aliasing to occur when doing memory split. - Also sets up for RCB/RDB implementation	2019-08-18 20:45:48 +03:00
Eladash	500a4fa2fb	rsx: Fix potential out of range methods execution (can result in segfaults)	2019-08-17 17:26:04 +01:00
Pierre-Loup A. Griffais	56011cbddd	vk: don't die on VK_SUBOPTIMAL_KHR in AcquireNextImage, and recreate swapchain vkAcquireNextImageKHR can also return VK_SUBOPTIMAL_KHR and is non-fatal. However, it's a good idea to still recreate the swap chain later to maintain optimal presentation paths after temporary occlusion.	2019-08-16 20:09:37 +03:00
kd-11	a0f0c418d7	gl:Implement proper support for packed 16-bit rendertargets - Also some minor refactoring	2019-08-15 14:00:17 +03:00
kd-11	7f85b18b46	gl: Add support for 4444 typeless texture	2019-08-15 14:00:17 +03:00
Megamouse	8debdfcd09	handle empty callback returns	2019-08-14 23:54:09 +02:00
RipleyTom	87bf0386c4	Screenshot function	2019-08-14 19:24:42 +02:00
Eladash	7fda07eb5b	rsx: UB fix (signed vs unsigned mismatch)	2019-08-13 20:48:50 +01:00
Eladash	519fe9309e	rsx: Fix nv0039::buffer_notify	2019-08-13 20:48:50 +01:00
Eladash	527b1bb071	rsx: Fix overlapping transfer of nv3089::image_in when out_pitch != in_pitch or out_pitch != out_bpp * out_w	2019-08-13 20:48:50 +01:00
kd-11	8866a3d6a9	rsx: Cleanup for blit engine fixes	2019-08-10 16:45:02 +01:00
kd-11	033836d88c	rsx: Minor fixup for nv3089::image_in - Typo scale_x->scale_y - Remove convoluted temp buffer creation and just use vector instead	2019-08-08 15:48:22 +03:00
kd-11	f0bd0b5a7c	rsx: Conditional render sync optimization - ZCULL queue was updated to one-per-cb but the conditional render sync hint was not updated. - Do not unconditionally flush the queue unless the upcoming ref is contained in the active CB. - This avoids spamming queue flush, which frees up resources and improves performance	2019-07-30 21:13:42 +03:00
Malcolm Jestadt	d689a6e47b	vk: Don't warn RADV users on LLVM 8.0.1 - The 'back screen' issue on RADV was resolved with LLVM 8.0.1	2019-07-30 19:56:05 +03:00
Nekotekina	f63e89f9b4	Implement waitable atomics Moved Atomic.h to util/atomic.hpp List source files in CMakeLists.txt	2019-07-29 03:04:55 +03:00
Nekotekina	ec2db8edbc	Correct get_int_t to get_uint_t. Add get_sint_t.	2019-07-29 00:12:07 +03:00
kd-11	1de90bdb1f	rsx: Improve aliased data preservation - Carve out inherited region if any - Perform pitch compatibility test before assigning old_surface	2019-07-27 16:09:21 +03:00
Eladash	230c3d55b6	Fixup	2019-07-27 04:03:29 +01:00
Eladash	fcc75c8b0f	rsx: Write atomically semaphore updates and fix zcull timestamp	2019-07-26 21:27:55 +03:00
Eladash	c53f0dd7b5	rsx: Fix gcm unmap events	2019-07-26 21:27:55 +03:00
Megamouse	71c56b719c	Emu/overlays: fix background picture path	2019-07-25 08:53:07 +02:00
Eladash	85b1152e29	Timers scaling and fixes	2019-07-23 00:09:01 +01:00
kd-11	9a7c2784f0	rsx: Do not clip scissor to viewport when doing buffer clear	2019-07-20 16:39:32 +03:00
kd-11	e2574ff100	rsx: Support CSAA transparency without multiple rasterization samples enabled	2019-07-19 15:49:08 +03:00
kd-11	b5a2f0df68	rsx: Implement separate viewport raster clipping - Merge viewport raster window and scissor into one clipping region - Viewport raster clip is different from viewport geometry clipping in hardware as the latter is configurable separately	2019-07-19 14:21:19 +03:00
scribam	a268415121	vk: Use macros from Vulkan SDK	2019-07-17 17:56:29 +03:00
kd-11	ea2f4d57fa	rsx: Fixups	2019-07-17 13:29:42 +03:00
kd-11	113a49e00c	rsx: Handle cyclic references when doing memory inheritance	2019-07-17 13:29:42 +03:00
kd-11	34b06453f9	rsx: Handle lost data due to unused data sections - After splitting, the sections may not be referenced at all for anything other than just pixel storage - In such cases, either merge down or sample from the upstream source instead	2019-07-17 13:29:42 +03:00
kd-11	998717659f	rsx: Fix reference leak when cloning surfaces	2019-07-17 13:29:42 +03:00
kd-11	009e01a347	rsx: Set up for multi-section inheritance	2019-07-17 13:29:42 +03:00
Malcolm Jestadt	94af3b3f03	vk: Fix Linux Vega float16_t workaround - It was disabling float16_t for non Vega cards	2019-07-12 12:25:46 +03:00
Eladash	17c8ac9ab8	rsx: Debugger output text fix	2019-07-12 00:19:56 +03:00
Eladash	c4d8ef4340	rsx: Allow to configure vblank rate Removed "HLE protection" hack from sys_rsx_context_attribute	2019-07-12 00:19:56 +03:00
Silent	f3551cedb7	rsx: Swap R and B channels in SET_BLEND_COLOR since this color is BGRA, not RGBA	2019-07-11 22:51:01 +03:00
kd-11	2898309f68	vk: Silence some debug prints - This message confuses some users	2019-07-11 13:22:13 +03:00
kd-11	fc09572648	rsx: Implement texel border decode - Texel borders are no longer actually supported in modern APIs - Removes the border texels and uses border color instead which is incorrect but should work fine	2019-07-11 13:22:13 +03:00
kd-11	d8f753f1e8	rsx: Do not allow framebuffer surfaces that exceed their allocated pitch dimensions - Truncate surfaces to forcefully fit inside the declared region	2019-07-11 13:22:13 +03:00
Eladash	78e447e28c	rsx: Typo fix	2019-07-09 22:47:55 +03:00
kd-11	2548057ea0	vk: Improve AMD driver support - Workaround broken fp16 in AMDVLK/RADV - Do not disable primitive restart as the issue seems to have been fixed	2019-07-09 16:27:59 +03:00
kd-11	956270d9be	gl: Add readback/writeback config for format GL_R16	2019-07-09 16:27:59 +03:00
kd-11	c072c511a1	rsx: Add support for slice padding rows when gathering slices for cubemap/3d	2019-07-09 16:27:59 +03:00
kd-11	9ca6546dec	vk: When reusing resources, make sure to reinitialize the component layout	2019-07-09 16:27:59 +03:00
kd-11	0cc672dcb3	vk: "Improve" initialization hack - Change default alpha to 1 from 0 - TODO: Implement memory tagging for synchronizing this	2019-07-09 16:27:59 +03:00
kd-11	219a5382f7	rsx: If no array streams are enabled, mark inline array as disabled (null render)	2019-07-09 16:27:59 +03:00
kd-11	7840cd914e	rsx: Fixup nv3089::image_in - Correct pitch when sourcing from temp block - Remove obsolete? double transfer that also introduced a stale pointer reference to freed memory	2019-07-09 16:27:59 +03:00
kd-11	c47f4fd59e	vk: Fix frame skipping	2019-07-09 16:27:59 +03:00
Nekotekina	b9130dd663	Remove redundant const on return value in rsx_methods.h	2019-07-09 12:09:21 +03:00
Eladash	d57b4dc8f3	rsx: Refactor rsx_decode.h and bugfixes	2019-07-09 11:52:34 +03:00
kd-11	50736263d2	gl: Fix native pitch computation	2019-07-08 18:04:56 +03:00
Eladash	6d65d3424f	rsx: Clamp fragment shaders address	2019-07-06 20:58:18 +03:00
kd-11	ad10eb391e	vk: Reuse discarded memory whenever possible instead of recreating new objects - Memory allocations are surprisingly expensive when spammed	2019-07-03 15:52:16 +03:00
kd-11	71e809a78b	rsx: Implement dma abort in case of a reset after misprediction	2019-07-03 15:52:16 +03:00
kd-11	0f11939faf	vk: Refactor gc	2019-07-03 15:52:16 +03:00
kd-11	ae93b417ec	vk: Handle emergency commandbuffer close with dangling queries - TODO: Refactoring	2019-07-03 15:52:16 +03:00
kd-11	d69e8288ad	vk: Restructure commandbuffer submission into tagged event IDs - Tagged eventIDs can be used to safely delete resources that are no longer used - TODO: Expand gc to collect images as well - TODO: Fix the texture cache to avoid over-allocating image resources	2019-07-03 15:52:16 +03:00
kd-11	ce04a797c3	vk: Fix event signal race when speculation fails to avoid a cache miss - TODO: Proper GC for stale events	2019-07-03 15:52:16 +03:00
Malcolm Jestadt	b5d5113803	gl: Workaround slow PBO usage with Mesa -Mesa is currently fastest with GL_STREAM_COPY -See `a338dc0186` -Also see https://bugs.freedesktop.org/show_bug.cgi?id=111043	2019-07-03 11:28:29 +03:00
msuih	690cdff0d3	Minor fixes - Fix a typo in OpenAL - Fix typo in cellHttp.h - Unused variables in catch - Use 64-bit shifts - Use use_count with shared pointers, unique is depracated and getting removed - Explicitly cast boolean to int - Signed/unsigned issues with loop variables - Fix missing return statement (the code path is unreachable, but compiler wants a return) - */ ouside of comment - Fix duplicate layout name	2019-07-01 04:33:23 +03:00
msuih	d57124d075	Explicitly cast size_t to integer types	2019-07-01 04:33:23 +03:00
msuih	146e43b6ec	Do not use negative unsigned literals	2019-07-01 04:33:23 +03:00
Eladash	2bce367488	Fixup for fixup (#6153 ) * Fixup for fixup * Fix memory ordering for MTRSX volatile doesnt block reordering. * ugh	2019-06-30 12:47:42 +03:00
Eladash	43f919c04b	Fixup after #6143 (#6146 ) vm::spu max address was overflowing resulting in issues, so cast to u64 where needed. Fixes #6145. Use vm::get_addr instead of manually substructing vm::base(0) from pointer in texture cache code. Prefer std::atomic_thread_fence over _mm_?fence(), adjust usage to be more correct. Used sequantially consistent ordering in semaphore_release for TSX path as well. Improved memory ordering for sys_rsx_context_iounmap/map. Fixed sync bugs in HLE gcm because of not using atomic instructions. Use release memory barrier in lwsync for PPU LLVM, according to this xbox360 programming guide lwsync is a hw release memory barrier. Also use release barrier where lwsync was originally used in liblv2 sys_lwmutex and cellSync. Use acquire barrier for isync instruction, see https://devblogs.microsoft.com/oldnewthing/20180814-00/?p=99485	2019-06-29 18:48:42 +03:00
Eladash	1ee7b91646	Refactoring (#6143 ) Prefer vm::ptr<>::ptr over vm::get_addr. Prefer vm::_ptr/base over vm::g_base_addr with offset. Added methods atomic_t<>::bts and atomic_t<>::btr . Removed obsolute rsx:🧵:Read/WriteIO32 methods. Removed wrong check in semaphore_release. Added handling for PUTRx commands for RawSPU MFC proxy. Prefer overloaded methods of v128 instead of _mm_... in VPKSHUS ppu interpreter precise. Fixed more potential overflows that may result in wrong behaviour. Added io/size alignment check for sys_rsx_context_iounmap. Added rsx::constants::local_mem_base which represents RSX local memory base address. Removed obsolute rsx:🧵:main_mem_addr/ioSize/ioAddress members.	2019-06-29 01:27:49 +03:00
JohnHolmesII	232a35b6fc	Various small warning fixes -Indentation warnings -prevent shift overflow -This was declared extern in all contexts. Remove this for initialization -Fix main return types. OH CANADA! -Silence extraneos 'unused expression' warning -Force use return value (warning) -Remove tautological compare copy-pasta (char always < 256)	2019-06-28 01:45:29 +03:00
JohnHolmesII	948c1df969	Remove unecessary vulkan loader check var, per kd	2019-06-28 01:45:29 +03:00
JohnHolmesII	a124ec4a26	Remove braces around shader source strings (warnings)	2019-06-28 01:45:29 +03:00
JohnHolmesII	ebb1ae6408	Properly ignore SIMD macros to avoid warning	2019-06-28 01:40:52 +03:00
JohnHolmesII	23094b48bb	Fix warnings related to -Wswitch Add default cases. Move default breaks to newline Add proper handling in some instances. Add missing enums to switches	2019-06-28 01:40:52 +03:00
JohnHolmesII	be521ff0ab	Fix warnings related to parentheses	2019-06-25 20:36:32 -07:00
kd-11	9ce7b8a401	vk: Add LLVM8 warning for RADV drivers	2019-06-25 20:50:54 +03:00
kd-11	009c55fcba	vk: Fix broken layout stream on first draw call	2019-06-25 20:50:54 +03:00
kd-11	4ff77a8555	rsx: Improve balancing of the offloader thread - Use two counters to avoid atomic operations - Yield instead of sleeping because some games are very sensitive to timing	2019-06-25 20:50:54 +03:00
kd-11	8249d51aa8	vk: Optimize occlusion pool management - Do not consume a slot every draw call, instead batch as many draws as possible - Since renderpasses are dispatched per-draw-clause, keeping occlusion queries outside the renderpasses works fine - If renderpasses are reorganized, occlusion tasks will have to be reorganized again	2019-06-25 20:50:54 +03:00
kd-11	1ee675e1f4	facepalm of the year - Typo fix - This check leads to forever relocating memory if size never exceeds capacity!	2019-06-25 20:50:54 +03:00
kd-11	2b9c315374	rsx: Use rpcs3 thread construct for the offloader thread	2019-06-25 20:50:54 +03:00
kd-11	d26b25816d	rsx: Improve profiling setup - Avoid spamming QPC when not needed - Free performance when debug overlay is not enabled	2019-06-25 20:50:54 +03:00
kd-11	b893a75002	rsx: Rework RSX offloading - Use a lockless queue - Do not enqueue small transfers	2019-06-25 20:50:54 +03:00
kd-11	c32c1b0a62	gl: Minor API tweaks - Avoid spamming the driver with samplerParameter calls unless the parameters have actually changed	2019-06-25 20:50:54 +03:00
kd-11	6a32f716db	rsx: Reimplement vertex layout streaming - Remove string comparisons from the hot-path! - Use attribute streaming and push constants to avoid forcing a descriptor block copy every other draw call/pass. While this isn't so bad on nvidia cards, it makes AMD cards a slideshow.	2019-06-25 20:50:54 +03:00
kd-11	59ee74a275	rsx: Disable vertex cache if multithreaded memory access is enabled - When multithreaded RSX is enabled, the vertex cache just lowers performance - The small cost of upload is paid by the asynchronous thread, allowing RSX to work optimally	2019-06-25 20:50:54 +03:00
kd-11	0fa3bcc336	rsx: Asynchronous data transfer	2019-06-25 20:50:54 +03:00
kd-11	358169507c	rsx: Use SSE to accelerate index buffer uploads	2019-06-25 20:50:54 +03:00
kd-11	b645ebdb04	vk: Refactor device management and improve driver detection	2019-06-25 20:50:54 +03:00
kd-11	25bba9bf56	vk: API update - use KHRONOS_validation instead of LUNARG_standard_validation which is deprecated	2019-06-25 20:50:54 +03:00
kd-11	f113cfe5c0	vk: Avoid some useless memory barriers - Do format conversions only when necessary	2019-06-25 20:50:54 +03:00
kd-11	c9501b60ab	rsx: Use explicit fma for MAD emulation	2019-06-25 20:50:54 +03:00
kd-11	6be7c58fa4	glsl: Refactoring, cleanup and optimizations - Avoid generating unused code - Reduce GPR usage in emitted code	2019-06-25 20:50:54 +03:00
Lassi Hämäläinen	c963c51a60	Remove unnecessary header includes - Manually removed lot of unneeded #includes to clean code and reduce compilation time - Reordered some of the #includes to be in more logical order	2019-06-25 17:11:10 +03:00
Lassi Hämäläinen	a070a414a6	Move rsx::constants and rsx::limits to rsx_utils.h	2019-06-25 17:11:10 +03:00
Lassi Hämäläinen	e9e87b8bd9	Add missing #includes to header files - Multiple header files where missing #includes to other headers that where used in the header. Correct header was included in correct order in source files which caused everything to compile. - Added missing #includes so header files correctly include all their dependencies and fixes problems with IDEs being unable to parse headers correctly due to missing symbols	2019-06-25 17:11:10 +03:00
Lassi Hämäläinen	499035512b	Split Emu/Memory into more logical headers - Add vm_locking.h and vm_reservation.h and move relevant functions and types to these headers. - Change include order and make vm_ptr.h, vm_var.h and vm_ref.h headers usable invidually and them including vm.h instead of other way around - Because usage of vm::ptr now requires including vm_ptr.h instead of vm.h updated multiple #includes - Added additional #includes to vm_reservation.h and vm_locking to where vm::reservation_* and locking related functions are used	2019-06-25 17:11:10 +03:00
Eladash	cd0ef99df5	Fix BE endianess arch support in semaphore_406e (#6116 ) Add raw() methods for endianness support types and make use of it.	2019-06-21 19:29:49 +03:00
scribam	185fd3d257	rsx: Minor cleanup after #6055	2019-06-17 00:31:38 +03:00
Megamouse	3f00b485a0	cellMsgDialogAbort: don't call on_close and properly re-enable pads	2019-06-15 00:24:10 +02:00
kd-11	e515d9b83a	vk: Fixup for missing resource reference - Missing ref increment when using framebuffer could lead to use-after-free. How master was not crashing is surprising	2019-06-14 16:19:52 +03:00
kd-11	c90186cf35	vk: Do not use pixel_center_origin as its use is explicitly restricted by spec	2019-06-14 16:19:52 +03:00
kd-11	98156d2a2c	vk: Avoid submitting wrong sample count in overlay passes	2019-06-14 16:19:52 +03:00
kd-11	4104d7a6a1	vk: Simplify WCB heuristics and fix out-of-bounds access	2019-06-14 16:19:52 +03:00
kd-11	86119f58d6	rsx: Typo fix	2019-06-14 16:19:52 +03:00
kd-11	9d166c5bed	rsx: Force invalidate of children by issuing a resolve notification whenever the parent is written to - Fixes successive reads of an antialiased surface that is still bound between reads	2019-06-14 16:19:52 +03:00
kd-11	296e0105c4	vk: Fix WCB for antialiased memory	2019-06-14 16:19:52 +03:00
kd-11	9d0f5aedf3	vk: Add some missing barriers	2019-06-14 16:19:52 +03:00
kd-11	e4671c29a6	rsx: Fix typo - Arguments to the transform function are xxyy not xyxy	2019-06-14 16:19:52 +03:00
kd-11	8a1cf2c913	rsx: Attempt to reduce stencil load overhead for nvidia cards	2019-06-14 16:19:52 +03:00
kd-11	ca82dd7200	vk: Improve overlay passes for resolve/unreolve - Refactor overlays and resolve passes to support use of push constants instead of relying buffer map/unmap - Add support for nvidia resolve (NV is the only vendor not supporting shader_stencil_export)	2019-06-14 16:19:52 +03:00
kd-11	c655036920	rsx/fp: Ease pressure on fragment shaders when emulating clamp16 - TODO: Option to completely skip clamping in some architectures as it is not needed in most games - Mostly affects older GPUs that do not have access to native fp16	2019-06-14 16:19:52 +03:00
kd-11	5f34c0c59a	vk: Clean up WCB readbacks when resource is multisampled - Resolve image first before performing any transfer operations	2019-06-14 16:19:52 +03:00
kd-11	9d314ca4ca	rsx: Correctly count number of valid entries if there are broken entries in the cache	2019-06-14 16:19:52 +03:00
kd-11	bca5f94b3f	rsx: Add option to toggle MSAA	2019-06-14 16:19:52 +03:00
kd-11	ea8409dcfd	rsx: Re-enable optional sample-to-pixel transformation	2019-06-14 16:19:52 +03:00
kd-11	acb14320da	rsx: Fixup for resolution scaling support	2019-06-14 16:19:52 +03:00
kd-11	4a5bbba277	rsx: Enable MSAA - vk: Enable depth buffer resolve+unresolve - vk: Add AMD stenciling extension support - rsx: Temporarily disables MSAA-compatible hacks such as transparency AA - TODO: Add paths to optionally disable MSAA	2019-06-14 16:19:52 +03:00
kd-11	f6f3b40ecc	rsx: Fix AA coordinate transforms - Requires native_pitch value to take samples into account	2019-06-14 16:19:52 +03:00
kd-11	655eff29e8	rsx: Refactoring and cleanup after d3d12 separation - Remove deprecated functionality - Refactor to share code between common routines	2019-06-14 16:19:52 +03:00
kd-11	db5d56a22d	d3d12: Remove all shared code with other backends	2019-06-14 16:19:52 +03:00
kd-11	0d906d6974	rsx: Remove surface aa_mode hacks	2019-06-14 16:19:52 +03:00
scribam	13671d9684	rsx: Apply Clang-Tidy fix "modernize-loop-convert" + const when relevant	2019-06-12 15:11:52 +03:00
scribam	1e327ad31b	rsx: Apply Clang-Tidy fix "readability-avoid-const-params-in-decls"	2019-06-12 15:11:52 +03:00
scribam	370dcd9d6e	rsx: Apply Clang-Tidy fix "readability-simplify-subscript-expr"	2019-06-12 15:11:52 +03:00
scribam	0b97d12a7b	rsx: Apply Clang-Tidy fix "modernize-use-using"	2019-06-12 15:11:52 +03:00
scribam	f1e939936a	rsx: Apply Clang-Tidy fix "modernize-use-override"	2019-06-12 15:11:52 +03:00
scribam	44265aa27d	rsx: Apply Clang-Tidy fix "modernize-use-equals-default"	2019-06-12 15:11:52 +03:00
scribam	635695ac78	rsx: Apply Clang-Tidy fix "modernize-use-emplace"	2019-06-12 15:11:52 +03:00
scribam	a555504142	rsx: Apply Clang-Tidy fix "modernize-deprecated-headers"	2019-06-12 15:11:52 +03:00
scribam	cba828384d	rsx: Apply Clang-Tidy fix "modernize-pass-by-value"	2019-06-12 15:11:52 +03:00
scribam	a02a8642b0	rsx: Apply Clang-Tidy fix "modernize-make-unique"	2019-06-12 15:11:52 +03:00
scribam	b91bcdbbca	rsx: Apply Clang-Tidy fix "modernize-use-bool-literals"	2019-06-12 15:11:52 +03:00
scribam	349e7c8708	rsx: Apply Clang-Tidy fix "readability-non-const-parameter"	2019-06-12 15:11:52 +03:00
scribam	35dc98be06	rsx: Apply Clang-Tidy fix "readability-string-compare"	2019-06-12 15:11:52 +03:00
scribam	ac7e89660f	rsx: Apply Clang-Tidy fix "readability-redundant-smartptr-get"	2019-06-12 15:11:52 +03:00
scribam	801fa0113f	rsx: Apply Clang-Tidy fix "readability-inconsistent-declaration-parameter-name"	2019-06-12 15:11:52 +03:00
scribam	c9b0a4afd0	rsx: Apply Clang-Tidy fix "performance-type-promotion-in-math-fn"	2019-06-12 15:11:52 +03:00
scribam	8f2647555a	rsx: Apply Clang-Tidy fix "readability-redundant-string-init"	2019-06-12 15:11:52 +03:00
scribam	331fe01762	rsx: Apply Clang-Tidy fix "performance-for-range-copy"	2019-06-12 15:11:52 +03:00
scribam	db926ee671	rsx: Apply Clang-Tidy fix "performance-unnecessary-value-param"	2019-06-12 15:11:52 +03:00
scribam	81a3b49c2f	rsx: Apply Clang-Tidy fix "readability-container-size-empty"	2019-06-12 15:11:52 +03:00
scribam	c4667133c4	gl/vk: Add constexpr to varying_registers and sync functions between the two backends	2019-06-12 10:59:31 +01:00
scribam	65581acbf9	rsx: Use constexpr for flattening_helper::m_register_properties	2019-06-12 10:59:31 +01:00
kd-11	d361eedbec	rsx: Clean up window management code - Removes a lot of wm_event code that was used to perform window management and is no longer needed. - Significantly simplifies the vulkan code. - Implements resource management when vulkan window is minimized to allow resources to be freed.	2019-06-10 14:57:03 +03:00
kd-11	57196f0504	vk: Move frame present synchronization to the driver - Just use a semaphore and let the driver handle it instead of manual framepacing. We lose framepace control but drivers have matured in the past few years so it should work fine.	2019-06-10 14:57:03 +03:00
scribam	39fa1d7031	ci/vk: Bump Vulkan version (1.1.73.0/1.1.97.0 => 1.1.106.0) VULKAN_SDK_MIRROR removed as the server is down	2019-06-09 23:43:57 +01:00
scribam	f9ad635856	rsx: TextGlyphs optimizations	2019-06-09 23:09:11 +01:00
Nekotekina	dfd50d0185	Implement std::bit_cast<> Partial implementation of std::bit_cast from C++20. Also fix most strict-aliasing rule break warnings (gcc).	2019-06-02 23:22:16 +03:00
scribam	99c1f87289	vk: Fix memory value in comments to match with the code below	2019-06-01 22:59:23 +03:00
scribam	09c9996f31	Use empty() instead of comparing size() with 0 Recommendation from Clang-Tidy: https://clang.llvm.org/extra/clang-tidy/checks/readability-container-size-empty.html	2019-06-01 22:59:23 +03:00
scribam	bf557ea6e6	Use the more efficient character literal overload for find_first_of/find_last_of Recommendation from Clang-Tidy: https://clang.llvm.org/extra/clang-tidy/checks/performance-faster-string-find.html	2019-06-01 22:59:23 +03:00
scribam	78c7ef3039	rsx: Use clear() instead of resize(0) The result is the same but clear [1] has slightly less code than resize [2] and signals better the intent IMHO. [1] `fb7fb646fa/libstdc%2B%2B-v3/include/bits/stl_vector.h (L1495)` [2] `fb7fb646fa/libstdc%2B%2B-v3/include/bits/stl_vector.h (L934)`	2019-06-01 22:59:23 +03:00
scribam	d9d7634f8b	vk: remove duplicate condition in pipeline_props struct equal operator	2019-06-01 00:01:32 +03:00
msuih	ef587d4cdc	Limit shaderlog writing behind log_programs setting	2019-05-31 19:49:32 +03:00
kd-11	f2cac26154	rsx: Refactor out GLSLTypes from GLSLCommon to avoid warning spam due to unused functions when included in settings dialog code	2019-05-31 13:27:43 +03:00
kd-11	6e92516070	vk: Do not reset descriptors from the aux buffer when things are running slow - The aux buffer borrows its descriptors from the lagging frame, so they are still in use until the frame completes.	2019-05-31 13:27:43 +03:00
kd-11	e118c9e5da	update glslang	2019-05-30 11:48:38 +03:00
Megamouse	34964e0e4f	handle some warnings	2019-05-28 21:47:49 +02:00
kd-11	d9ab2c7104	vk: Bump shaders cache version - Pipeline properties changed with the renderpass update	2019-05-28 15:28:30 +03:00
kd-11	57eb892153	vk: Refactor framebuffers - Refactor out framebuffers from the renderer core - Use a proper cache with sorted queues for faster searching	2019-05-28 15:28:30 +03:00
kd-11	507ec8252b	vk: Refactor renderpass management - Ensures the current renderpass matches the image properties even when a cyclic reference is detected - Solves SDK debug output error spam due to mismatching layouts and renderpasses	2019-05-25 14:07:29 +03:00
Malcolm Jestadt	c348fec84b	Warn AMD linux users about potential performance loss if not using RADV	2019-05-24 17:16:29 +03:00
Malcolm Jestadt	6ab3011eef	vk: Check_window_status fixups Intel ANV has been tested and verified to work without workaround AMDVLK and the proprietary AMD driver have been confirmed to require workaround for window resizing	2019-05-24 17:16:29 +03:00
kd-11	370b9e196d	vk: Improve descriptor pool management - Add double-buffered descriptor pools to avoid use-after-free situations - Make descriptor pools more configurable - Also adds in a hack to allow renderdoc to capture properly	2019-05-22 01:18:46 +03:00
kd-11	46ba53f122	vk: Propagate more information to the driver - Pass "correct" layout to descriptors - TODO: Fix renderpass attachment descriptors which are inadvertently doing silent transitions	2019-05-22 01:18:46 +03:00
kd-11	c3b234f972	gl: Fix staging buffer size calculation	2019-05-22 01:18:46 +03:00
Malcolm	9a26c0abda	Overlays: Fix timing	2019-05-21 13:01:38 +03:00
Nekotekina	9abb303569	vm: expand reservation lock bit area to 7 bit This is minor change.	2019-05-19 17:46:55 +03:00
kd-11	8009e53642	rsx: Fix upload block range optimization - The 'max' index should take the first assigned ID; fixes problems with divisors	2019-05-19 17:33:21 +03:00
kd-11	a245d9fb24	vk: DOuble general-purpose heap allocation to 128M and add a better diagnostic message for OOM	2019-05-19 17:33:21 +03:00
kd-11	0ef7b2aaff	rsx: Use a saner model for swap queue handling - Use a simple queue to avoid redundant checks over all the contexts - Poll queue if RSX pipe is idle - Only check the queue when the frame context is dirty (after a queue operation) - Reset descriptors at the start of the frame context to avoid having to synchronize mid-frame - Fully synchronize if a descriptor reset is required mid-frame (spec compliance; also fixes flickering verts on some hardware)	2019-05-19 17:33:21 +03:00
kd-11	dc749d3975	vk: Bump max number of allocated draw calls from 4k to 16k	2019-05-19 17:33:21 +03:00
kd-11	e3f68c66d8	rsx: Use a shared sampler pool instead of relying on the drivers	2019-05-17 22:51:40 +03:00
Megamouse	edb1a32bb1	overlays: use L1 and R1 to step by 10 in the save data list	2019-05-17 20:21:23 +02:00
Megamouse	32bdd8ef7b	overlays: move some code to cpp files	2019-05-17 20:21:23 +02:00
kd-11	4037225e98	vk: Workaround for cyclic feedback loops - Transition attachments to LAYOUT_GENERAL in case of a feedback loop - Fixes appearance of garbage along polygon edges in some post-processing passes. - Also reverse this transition when rendering goes back to normal	2019-05-17 16:41:17 +03:00
kd-11	cb78522620	rsx: Fixup for uninitialized surface antialiasing mode	2019-05-16 19:25:26 +03:00
kd-11	45a13d0319	rsx: Fixup for lost aliased surfaces - Intersection routines were changed and require explicit identification of the "old surface"	2019-05-16 19:25:26 +03:00
kd-11	05eb1e9193	rsx: Fix zombie image references from inside the texture cache - Do not add locked orphans to the flush_always cache! They will not remove their cache entries as they are not bound	2019-05-16 19:25:26 +03:00
kd-11	214bb3ec87	rsx: Always initialize memory unless it is guaranteed to be wiped	2019-05-16 19:25:26 +03:00
kd-11	88290d9fab	rsx: Hack around using data regions as transfer targets	2019-05-16 19:25:26 +03:00
kd-11	4182f9984d	rsx: Propagate split section information back to the texture cache	2019-05-16 19:25:26 +03:00
kd-11	3c7d8a1099	rsx: Minor texture/surface scanning optimization - Also re-enable optimization in blit engine accidentally disabled during debugging	2019-05-16 19:25:26 +03:00
kd-11	9f0090772a	rsx: Fix write tagging when comments are transferred in by blit engine	2019-05-16 19:25:26 +03:00
kd-11	4b443be881	rsx: Fix self-intersection with previous occupant of the address being replaced	2019-05-16 19:25:26 +03:00
kd-11	b840f6da28	[WIP] rsx: Use a sane reference counting model	2019-05-16 19:25:26 +03:00
kd-11	e3cf3ab6b8	rsx: Minor fixes - Fix transfer scaling (inverted) - Fix under-estimated typeless acquisition when doing depth format scaling	2019-05-16 19:25:26 +03:00
kd-11	e02e27b2b3	rsx: Prevent out-of-bounds writes when resolving shader input textures - The target area can also have padding!	2019-05-16 19:25:26 +03:00
kd-11	1c439f6198	vk: Fix some spec violations	2019-05-16 19:25:26 +03:00
kd-11	88c20afd3a	rsx: Implement unaligned surface inheritance with hierachial contribution - Allows render targets to behave like stacked 3D views same as shader inputs are resolved - Basically implements most of 'Read Color/Depth Buffers" option for 'free'. - Allows splitting RTV/DSV resources if they are superceded by a partial surface - Also allows intersecting new resources through the surface cache for proper inheritance from other scattered data - TODO: Refactor bind_surface_as_rtt and bind_surface_as_ds to reduce asinine code duplication	2019-05-16 19:25:26 +03:00
scribam	22f61caf9f	GLTexture: add missing #pragma once directive	2019-05-12 18:32:11 +03:00
scribam	6c5ea068c9	Remove redundant semicolons Fix "-Wextra-semi" warnings	2019-05-12 18:32:11 +03:00
scribam	3623f4343f	gl/vk: clear scissor_setup_invalid bit along with scissor_config_state_dirty bit	2019-05-11 13:13:49 +03:00
eladash	7ead021aa7	rsx: Fix 3d swizzled texture to linear conversation	2019-05-08 23:48:39 +03:00
Megamouse	5141590729	overlays: add separate timestamp for the start of the d-pad interval	2019-05-06 22:00:40 +02:00
Malcolm Jestadt	fd2bc95a7b	overlays: Double dpad repeat rate	2019-05-06 22:00:40 +02:00
kd-11	9c346c92f3	gl: undo an accidental deletion	2019-05-05 13:37:55 +03:00
kd-11	2bec304cca	vk: Allow some drivers to bypass window polling if not needed	2019-05-05 13:37:55 +03:00
kd-11	6b7cd458e3	rsx: Silence some diagnostics unless compiled with debugging options	2019-05-01 15:36:21 +03:00
kd-11	1d5c52f476	rsx: Ignore stencil clear flag if the stencil write mask is disabled	2019-05-01 15:36:21 +03:00
kd-11	48cb265c2c	rsx: Bounds check on local resource for atlas merge. - Local resources can also have padded pitch dimensions and false-positives on range overlap tests	2019-05-01 15:36:21 +03:00
kd-11	63f9b8e0c6	gl/vk: Minor cleanup	2019-05-01 15:36:21 +03:00
kd-11	ec9aa74008	rsx: Fix section base offset calculation for blit_dst targets which affects confirmed memory range - Fixes flushes only writing partially to target memory	2019-05-01 15:36:21 +03:00
kd-11	4e3ec162e2	rsx: Fix broken texture cache search when flipping	2019-05-01 15:36:21 +03:00
kd-11	6feffe6ff6	rsx: Ignore transfer offsets when wrapping behaviour is expected	2019-05-01 15:36:21 +03:00
kd-11	f56a6548b0	gl: Remove workaround for AMD driver bug fixed in driver 19.4.3	2019-05-01 15:36:21 +03:00
kd-11	243df38360	rsx: Fix VP writes to CC with a MOV instruction - When moving to CC, the operation has VEC flag disabled and also temp regs disabled. Looks to be the catch-all ELSE in the selection logic.	2019-04-25 16:23:05 +03:00
kd-11	3cbccdd760	rsx: Fragment shader decompiler cleanup TODO: Investigate the _s input modifier behaviour further, in case it can avoid generating zeroes from a MAD instruction. x = MAD(+ve, -ve, -ve) with _s input modifier in BFBC expects result to be Non-zero	2019-04-25 16:23:05 +03:00
kd-11	4cd1c25729	"rsx: Ignore argument sign for SQRT operations"	2019-04-25 16:23:05 +03:00
kd-11	32396ba366	rsx: Simplify use of some mixed input functions using OPFLAGS to avoid implicit conversions	2019-04-25 16:23:05 +03:00
kd-11	f12bd8068c	rsx: Fragment decompiler fixups - Properly test for NaN and Inf when clamping down to fp16 - Optimize divsq a bit; mix(vec, vec, bvec) emits OpSelect which is what we want here, instead of component-wise selection which is much slower.	2019-04-25 16:23:05 +03:00
kd-11	abe7188acf	rsx: Proper workaround for broken DIVSQ instruction on realhw - While mul(0, nan) = nan and 0 / 0 = nan, 0 / sqrt(0) = 0 because of hw gremlins. normalize(0) is also nan so this behaviour does not work around that particular case either which makes it even more baffling.	2019-04-25 16:23:05 +03:00
kd-11	60f3059d22	rsx: Compensate for nvidia's low precision attribute interpolation - The hw generates inaccurate values when doing perspective-correct interpolation of vertex output attributes and makes the comparison (a == b) fail even when they are a fixed constant value. - Increase equality tolerance when doing comparisons in fragment shaders for NV cards only to work around this issue. - Teepo fix	2019-04-25 16:23:05 +03:00
kd-11	463b1b220d	rsx: Improve accuracy of shadow compare Ops when non-integer depth formats are used - The fixed-point D24S8 format does special Z clamping during compare which matches PS3 behaviour - D32S8 is a floating point format and comparison with Dref > 1 always fails causing black edges/borders	2019-04-25 16:23:05 +03:00
kd-11	7ad1646c2c	vk: Skip feature check if extension is not supported	2019-04-25 16:23:05 +03:00
kd-11	06a85f00d1	rsx: Shader decompiler cleanup and improvements - Improve support for float16_t by minimizing mixed inputs to functions (ambiguous overloads) - Minimize amount of downcasts in code by using opcode flags - Re-enable float16_t support for vulkan	2019-04-25 16:23:05 +03:00
kd-11	a668560c68	rsx: Use native half float types if available - Emulating f16 with f32 is not ideal and requires a lot of value clamping - Using native data type can significantly improve performance and accuracy - With openGL, check for the compatible extensions NV_gpu_shader5 and AMD_gpu_shader_half_float - With Vulkan, enable this functionality in the deviceFeatures if applicable. (VK_KHR_shader_float16_int8 extension) - Temporarily disable hw fp16 for vulkan	2019-04-25 16:23:05 +03:00
kd-11	ee319f7c13	rsx: Implement strict clamp16 operation needed for NVIDIA cards	2019-04-25 16:23:05 +03:00
eladash	6f76e34104	rsx: Fix race on clearing native_ui vs emu_requested flag	2019-04-20 01:04:41 +03:00
eladash	888cb9d673	Remove reader_lock executed in every instruction by RSX Use optimistic double check instead, use one load instruction for the check to be atomic + Read emu status once every FIFO iteration	2019-04-20 01:04:41 +03:00
eladash	f25587d24c	rsx: Write vblank semahpre, minor semaphore acquire optimization	2019-04-20 01:04:41 +03:00
Megamouse	b929c13c45	implement get_firmware_version add firmware version to the first line in the log	2019-04-16 22:13:28 +02:00
kd-11	df3b46a611	rsx: Improve texture sourcing and clipping when reverse scanning is enabled - When reverse scanning, offsets are inverted and offset value of 0 is logically equivalent to an offset of -1 - Add an explicit message if clipping happens to avoid silent errors/bugs	2019-04-12 15:36:21 +03:00
kd-11	12dc3c1872	vk: Dynamic heap management to potentially fix ring buffer overflows - Allows checking one heap type at a time, on demand - Should avoid OOM situations unless inside an uninterruptible block	2019-04-09 13:40:54 +03:00
kd-11	a4495c35b7	rsx: Fixups for swizzled texture scanning - Revert to using block metrics, but with optional per-channel decode stage for the final transfer. Much cleaner than hacking in the width to be in channels instead of blocks.	2019-04-09 13:40:54 +03:00
kd-11	a5ed30a8c0	rsx: Fixups for data cast operations via typeless transfer	2019-04-09 13:40:54 +03:00
kd-11	f04a0a2bb6	rsx: Remove some old restrictions affecting memory persistence	2019-04-09 13:40:54 +03:00
kd-11	0a604e39f1	rsx: Implement RGB655 decode	2019-04-09 13:40:54 +03:00
kd-11	cc3809fbfe	gl: Register a few more missing formats for conversion	2019-04-09 13:40:54 +03:00
kd-11	e4e86455f2	rsx: Fix temporary subresource caching behaviour - Do not cache if a gathered subresource contains a bound RTT - Change op to dynamic copy if parent is still bound	2019-04-09 13:40:54 +03:00
kd-11	3249000511	rsx: Improvements to texture scanning - Removes CPU-only transforms that broke GPU-side code. -- Channels in GPU compute are laid out in cell-order, but CPU was uploading in favorable order and compensating with swizzles. -- This leads to 2 different layouts depending on the location of the data (CPU vs GPU) - Implement R8G8_R8B8 interleaved format decode - General improvements	2019-04-09 13:40:54 +03:00
kd-11	0f7af391d7	vk: Implement copy-to-buffer and copy-from-buffer for depth_stencil formats - Allows D24S8 and D32S8 transport via typeless channels - Allows uploading and downloading D24S8 data easily - TODO: Implement optional byteswapping to fix flushed readbacks with the same method	2019-04-09 13:40:54 +03:00
kd-11	366e4c2422	rsx: Preliminary support for format conversions using typeless resolve	2019-04-09 13:40:54 +03:00
kd-11	b7470cfc1a	rsx: Tighten format checks in cache hit tests	2019-04-09 13:40:54 +03:00
kd-11	443fde760f	rsx: Blit engine clipping fixes - Do not round up sub-pixel offsets, round down instead - Do not allow incomplete sources for hw blit transfer - Reimplement src clipping (slice_h) - Check 'area' of incoming texels and correct for them before RTT lookup/transfer - Filter out incomplete targets when performing RTT lookup (1 texel or less contribution)	2019-04-09 13:40:54 +03:00
eladash	8185ef7610	rsx: Improve vblank accuracy	2019-03-31 14:57:21 +03:00
eladash	801e6114b6	rsx: Use relaxed store on fifo ctrl registers	2019-03-31 14:57:21 +03:00
kd-11	41b87cf577	rsx: Blit engine fixes - If a transfer writes to a RTT and depth mismatch happens, create a local target and the upload function will likely resolve between the two - If a surface is rejected, reset the target region!	2019-03-22 21:27:15 +03:00
kd-11	86ad204636	rsx: Rebase output region when using upload-fallback path	2019-03-22 21:27:15 +03:00
kd-11	dbc8e70ddd	rsx: Silence some compiler noise	2019-03-22 21:27:15 +03:00
kd-11	3a4e3fa53a	rsx: Fix use-after-modify condition when inserting a draw command out of order - Fixes barrier->range rebase after the insert	2019-03-22 21:27:15 +03:00
kd-11	d731c07ade	vk: Fix typeless resource management - Fixes bugs that appear with high resolution scaling	2019-03-22 21:27:15 +03:00
kd-11	adc59f9810	rsx: Fix blit transfers when texel sizes mismatch - Also refactors some bpp handling code - Simplify texture intersection test to use a normalized/uniform coordinate space - Fix broken bounds checking as well	2019-03-22 21:27:15 +03:00
kd-11	b879b32271	rsx: Fix bpp calculation taking resolution scaling into account - Do not rely on image->width(), use surface_width() instead for unscaled values - Refactor/clean GL rendertarget class a bit	2019-03-20 10:05:54 +03:00
kd-11	03fca73cf4	rsx: Fix blit intersection falling outside the available texture - Just becaue we have a hit inside the tile of interest does not guarantee that it sits inside the texture!	2019-03-20 10:05:54 +03:00
kd-11	3ef16bee47	rsx: Fix texture lookups and avoid out-of-bounds copies/transfers	2019-03-17 21:50:11 +03:00
kd-11	bb65e45614	rsx: Implement GPU acceleration for rotated images	2019-03-17 21:50:11 +03:00
kd-11	5260f4b47d	rsx: Improvements to memory flush mechanism - Batch dma transfers whenever possible and do them in one go - vk: Always ensure that queued dma transfers are visible to the GPU before they are needed by the host Requires a little refactoring to allow proper communication of the commandbuffer state - vk: Code cleanup, the simplified mechanism makes it so that its not necessary to pass tons of args to methods - vk: Fixup - do not forcefully do dma transfers on sections in an invalidation zone! They may have been speculated correctly already	2019-03-17 21:50:11 +03:00
kd-11	385485204b	vk/gl: Omit unlocked data when grabbing flip sources from texture cache	2019-03-17 21:50:11 +03:00
kd-11	74eeacd091	vk/gl: Improve memory tag sync and test - Properly pass parameters such as rsx-pitch to the surface store - Do not crash if a surface fails verification in flip, use fall-back instead	2019-03-17 21:50:11 +03:00
kd-11	1a44446250	rsx: Fix dst upload block region - The section needed starts at image origin, not transfer origin!	2019-03-17 21:50:11 +03:00
kd-11	a49a0f2a86	vk/gl: Synchronization improvements - Properly wait for the buffer transfer operation to finish before map/readback! - Change vkFence to vkEvent which works more like a GL fence which is what is needed. - Implement supporting methods and functions - Do not destroy fence by immediately waiting after copying to dma buffer	2019-03-17 21:50:11 +03:00
kd-11	85cb703633	rsx/cache: Debugging bugs introduced by the atlas coverage check - Figured out why it breaks things, ofc can't actually check for coverage when there is no proper fbo data persistence	2019-03-17 21:50:11 +03:00
kd-11	3a4083263e	rsx: Fix texture transfer when pitch does not match exactly	2019-03-17 21:50:11 +03:00
kd-11	612160a8ff	rsx: Fix zero-pitch textures - Assumption here is that only texel (0, 0) is accessible. Inline with other pitch 0 operations. - TODO: Verify pitch 0 does not advance in Y either	2019-03-17 21:50:11 +03:00
kd-11	17c49d21a5	rsx/blit: Remove workarounds/hacks added for master. Start implementation/stubs for blit engine rotations in GPU	2019-03-17 21:50:11 +03:00
kd-11	745f8f9627	rsx: Remove pointless assert	2019-03-17 21:50:11 +03:00
kd-11	1875dc3f18	gl: Fix buffer size calculations	2019-03-10 16:09:05 +03:00
kd-11	358558aaa7	cleanup and fixups	2019-03-10 16:09:05 +03:00
kd-11	04dda44225	rsx: Properly generate render target data with all parameters provided - Build-up to variable-sized framebuffers and AA implementation - Also allows accurate range calculation for our hit testing	2019-03-10 16:09:05 +03:00
kd-11	21bc6c7a87	rsx: Properly resolve data for upload when needed. - Avoids blindly reusing blit dst sections as they may contain garbage. If a section was unlocked for a flush, just discard it as its reuse introduces potential data corruption. Since the data needs to be reuploaded anyway (for now), its better to start afresh - In case of format mismatch, reset the calculated dst block - Add a bounds check to determine if data contained in an atlas is good enough for sampling the cache. If not enough data is provided, fall back to full upload	2019-03-10 16:09:05 +03:00
kd-11	9d4d3d9443	rsx: Reimplement render target intersection tests when using hw accelerated blit engine - Properly collapse memory tree when scanning in case of overlaps!	2019-03-10 16:09:05 +03:00
kd-11	f4ebcb0029	rsx: Properly decode packed renders from the type flag - Seems to occupy bits [8-9]	2019-03-10 16:09:05 +03:00
kd-11	7c379432dd	rsx: Implement proper pitch compatibility lookup - When a single row is required or is all that is available, pitch has no meaning as the coordinate space changed to 1D	2019-03-10 16:09:05 +03:00
kd-11	dccb4a4888	rsx/texture_cache: fixes to commit_framebuffer_memory	2019-03-10 16:09:05 +03:00
kd-11	b9e7b085fe	rsx/texture_cache: Fixups for local resource hit and fast-path added	2019-03-10 16:09:05 +03:00
kd-11	a80f1a6ed4	gl: Fix memory tag sampling - Also fixes a bad arg passed to glClearBuffer	2019-03-10 16:09:05 +03:00
kd-11	0395fb9955	rsx/tecture_cache: Addendum - fix data cast with scaling conversion (AA emulation) - Blit operations do format conversion automatically which is NOT what we want! - Scale onto temp buffer with similar format before performing data cast.	2019-03-10 16:09:05 +03:00
kd-11	10dc3dadee	rsx/texture_cache: Improve framebuffer memory locking when WCB/WDB is not enabled - Adds a new mode that removes non-framebuffer stuff inside framebuffer range	2019-03-10 16:09:05 +03:00
kd-11	563e205a72	rsx/texture_cache: Fix 'AA' scaling hack and restore collection template selection	2019-03-10 16:09:05 +03:00
kd-11	fa628f0ac4	rsx/surface_store: More aggressive tag sampling - Use a 5-point tap with an X pattern across the target's memory space to reduce chances of false positives - TODO: Potential false positives identified, requires some minor restructuring of surface_store	2019-03-10 16:09:05 +03:00
kd-11	3a071a9c07	rsx: Texture search rewrite - Perform a full search across all resource types as needed without taking too many shortcuts/hacks	2019-03-10 16:09:05 +03:00
kd-11	6ef9dcd62e	rsx: Handle mismatched/invalidated framebuffer sections when WCB is enabled	2019-03-10 16:09:05 +03:00
kd-11	ef071ebb6b	rsx: Synchronize surface cache and texture cache data - TODO: The whole upload_texture thing is a big hack, fix it properly	2019-03-10 16:09:05 +03:00
elad	bd259c8ae4	vulkan zcull: Fix deadlock in zcull flush waiting Block adding additional flush requests until the first ones are treated (by adding missing lock)	2019-03-08 23:44:46 +03:00
elad	fc253165e2	Correctness fix for RSXIOMem - Make RSXIOMem volatile. - Hint the compiler to check only once the address returned.	2019-03-08 23:44:46 +03:00
elad	ce8c92262d	Treat X8R8G8B8 format as A8R8G8B8 in image_in, Fixes #5510	2019-03-08 23:44:46 +03:00
eladash	d82362fa1d	Use sys_memory_allocate on rsx replayer to fix it	2019-03-05 21:23:24 +03:00
German	4c72f7c1de	Fix clear string container in CgBinaryFragmentProgram.cpp	2019-02-18 16:34:16 +03:00
Megamouse	17a5e0bc98	cellGame: add error_code	2019-02-12 21:06:10 +03:00
kd-11	19ff95da70	vk: Fix usage of VK_IMAGE_LAYOUT_GENERAL - Properly synchronize when transitioning to/from GENERAL layout. - General layout requires full pipeline dependency since its used in a 'general' sense. As such, its use is to be largely avoided.	2019-02-07 11:40:17 +03:00
kd-11	38887bc03e	gl/vk: Improvements to overlay rendering - gl: Properly initialize and manage sampler states - gl/vk: Snap overlay elements to pixel grid by aligning to pixel centers - overlays: Disable grid snapping in stb since its now handled in the backend	2019-02-05 12:15:12 +03:00
kd-11	4c593959fd	overlays/save_dialog: Layout improvements - Make detail a separate text entity as it often contains a lot of noise - Properly pad the entry if needed to avoid text sitting too close to the edge	2019-02-03 22:26:46 +03:00
kd-11	67cdec577f	overlays/util: Add support for glyph set lowering when mapping utf8 to ascii8 - Lower fullwidth glyphs to halfwidth counterparts - Lower CJK punctuation glyphs - Lower general punctuation glyphs	2019-02-03 22:26:46 +03:00
kd-11	a36d3af3b4	vk: Minor frame management improvements	2019-02-02 11:54:01 +03:00
kd-11	27af05da1a	osk: Fixup attempt for hang in close callback where a sysutil_callback fails to fire.	2019-02-02 11:54:01 +03:00
kd-11	b36cb66129	overlays: Allow use of extended ascii8 - Use custom string conversion to ensure overlay deals with extended ascii whenever possible - Improves language compatibility greatly and avoids empty spaces for unknown glyphs	2019-02-02 11:54:01 +03:00
kd-11	12990f3ca3	overlays/util: Strip extended codes from utf-16 encoded strings	2019-02-02 11:54:01 +03:00
kd-11	9e39e2d2c4	gl/vk: Fix clip region scaling for overlay elements	2019-02-02 11:54:01 +03:00
kd-11	3653c2eb0d	overlays/osk: Add support for edit text control and disabled cells - Allows to disable cells from being selectable. - Edit text control adds proper support for multiline and a functioning caret	2019-02-02 11:54:01 +03:00
kd-11	faf5221b0d	overlays: Implement edit_text control	2019-02-02 11:54:01 +03:00
kd-11	c434e0ce27	overlays/osk: Add more buttons to native dialog and other improvements - Adds all the major buttons to native dialog input options - Adds more button options for the native osk - Brighten osk cell backgrounds a bit to improve visibility	2019-02-02 11:54:01 +03:00
kd-11	9ed9d7e947	overlays/osk: Implement native osk interface	2019-02-02 11:54:01 +03:00
kd-11	9d4b19b97a	vk: Increase number of draw calls per frame for overlays to 1024 - Allows for more complex interface design	2019-02-02 11:54:01 +03:00
kd-11	f47d3a761b	vk: Hotfix for fullscreen not working on non-windows platforms	2019-02-01 00:22:11 +03:00
kd-11	09a8f7ae53	vk: Use FIFO mode for vsync - Avoids tearing and also hides some driver bugs causing fullscreen bugs with mailbox mode	2019-01-31 21:53:02 +03:00
kd-11	3bfa564ef8	vk/windows: Try to keep msq thread from ever stopping - NVIDIA drivers hook into the msq before our nativeEvent handler. This means NV is aware of events before rpcs3 is aware of them and sometimes stops until a new event is triggered. If rpcs3 is inside a driver call at this time, the system will deadlock since the driver waits for msq which waits for the renderer which waits for the driver. - Use explicit hook management to control window events - Add fence timeout to attempt detection of surface loss events	2019-01-31 21:53:02 +03:00
eladash	6f770c8e35	Fix potential crash in begin_occlusion_query() while closing the Emu	2019-01-30 18:44:29 +03:00
kd-11	660bfeabae	gl: Fixup - inline arrays	2019-01-25 14:34:22 +03:00
kd-11	fa9b448686	vk: Spec fixups - Disable DEPTH<->RGBA typeless transfers for now as they require a lot more work to work for all vendors - Do not allow switching layouts to UNDEFINED/PREINITIALIZED formats	2019-01-25 14:34:22 +03:00
kd-11	2163a59649	rsx: Typo fix	2019-01-25 14:34:22 +03:00
kd-11	521969bcc3	gl: Remove GL_R 'format'. There is no GL_R format, it part of the S-T-Q-R enums for texture coordinate space	2019-01-25 14:34:22 +03:00
kd-11	5a4bea8c4f	gl: Blit fixup - Typo fix. I meant to disable scissor test, not stencil test - Also clean up and simplify/optimize the core logic	2019-01-25 14:34:22 +03:00
kd-11	7e33cdcb08	rsx: simple_array<T> improvements - Implement move and copy ctors	2019-01-25 14:34:22 +03:00
kd-11	fb778e4821	rsx: Reimplement attrib divisor	2019-01-25 14:34:22 +03:00
kd-11	736415fcd9	rsx/fp: Detect broken/NOP shaders automatically - Do not compile body if the shader is of no consequence, leave as a passthrough shader	2019-01-25 14:34:22 +03:00
kd-11	6fdc0fd7f0	rsx: Reimplement MSAA transparency - Apply dither to edges that almost fail the straight-up alpha test - Significantly improves alpha tested geometry far from the camera - Also removes blend factor overrides/hacks as they give incorrect results due to background bleeding	2019-01-25 14:34:22 +03:00
kd-11	10a17feda2	rsx: Avoid potential deadlock in FIFO_ctrl	2019-01-25 14:34:22 +03:00
kd-11	7eec702c6d	gl: Fix silly regression with blit dst resource readback	2019-01-25 14:34:22 +03:00
kd-11	8093c9b573	rsx: Disable rtt side-effects when async compilation is ongoing. Only real renders should promote buffer state from underined to drawn, otherwise keep previous contents intact.	2019-01-25 14:34:22 +03:00
kd-11	417a2e6731	rsx: Refactor index buffers - Index offset is ignored anyway and only used to calculate vertex attribute divisor index - Specialized optimization for untouched xfer without primitive restart	2019-01-25 14:34:22 +03:00
Megamouse	8d5d44141e	rsx/Qt: fix some undefined behavior in progress_dialog CallAfters	2019-01-22 12:04:01 +03:00
eladash	688d5a9919	rsx: Fix unknown vertex base types Clamp vertex type field into 3-bits instead of 4-bit value Case 0 is UB256	2019-01-21 22:28:20 +03:00
Nekotekina	59e0296281	cellMsgDialog: fix error spam on CELL_OK	2019-01-18 16:49:17 +03:00
Nekotekina	a419e98acb	Move PPU and shader cache New hash-based location (already used for SPU) Bump PPU cache version, improve naming and decrease size Remove fs::get_data_dir Disable boot.elf cache	2019-01-14 01:24:05 +03:00
Nekotekina	bd9131ae1c	Implement fs::get_cache_dir Win32: equal to config dir for now Linux: respect XDG_CACHE_HOME if specified OSX: possibly incomplete	2019-01-13 14:45:36 +03:00
eladash	bc27f5f75c	Implement invalid NV4097_NOTIFY context handling	2019-01-13 12:59:00 +03:00
Megamouse	d9d5f45e9e	rsx/input: fix rsx replay	2019-01-10 13:05:48 +01:00
kd-11	52ac0a901a	rsx: improve memory coherency - Avoid tagging and rely on read/write barriers and the dirty flag mechanism. Testing is done with a weak 8-byte memory test - Introducing new data when tagging breaks applications with race conditions where tags can overwrite flushed data	2019-01-06 10:44:40 +03:00
kd-11	89c9c54743	rsx: Minor hot-fix - Pitch 0 makes sense if width == 1 and height == 1	2019-01-06 10:44:40 +03:00
kd-11	95245bdd83	rsx: Improve ARGB8->D24S8 casting - Set up partial transfers - Force clear of target before starting the transfer	2019-01-06 10:44:40 +03:00
kd-11	475cc99117	rsx: Fix dirty flag reset after a partial attachment initialization - D24S8 targets have 2 aspects that are dealt with separately; Forcefully initialize the remaining data if a partial init is done. Its 'free' anyway - It seems that the stencil mask matters when clearing unlike the depth mask and color mask	2019-01-06 10:44:40 +03:00
kd-11	c80c7f06bb	rsx: Typo fix - This silly typo broke the flip improvements in the GT fixes PR	2019-01-06 10:44:40 +03:00
kd-11	2a62fa892b	rsx: Texture cache refactor - gl: Include an execution state wrapper to ensure state changes are consistent. Also removes a lot of required 'cleanup' for helper methods - texture_cache: Make execition context a mandatory field as it is required for all operations. Also removes a lot of situations where duplicate argument is added in for both fixed and vararg fields - Explicit read/write barrier for framebuffer resources depending on usage. Allows for operations like optional memory initialization before reading	2019-01-06 10:44:40 +03:00
kd-11	0f64583c7a	rsx: Reimplement pitch lookup - Remove the required_xxx_pitch constraint as it makes no sense. The pitch controls what can be written per line. - It is possible to have a huge surface width but only render to a small region at the beginning and have a smaller pitch than can fit the surface (NFS carbon)	2019-01-06 10:44:40 +03:00
kd-11	1ffadbe086	rsx: Reorganize write barrier implementation (either clear or memory barrier)	2019-01-06 10:44:40 +03:00
kd-11	9c45ce6d37	vk: Reimplement typeless memory allocation to handle resolution upscaling	2019-01-06 10:44:40 +03:00
kd-11	d8e45c86e6	vk: Remove old useless hack that interferes with memory inheritance	2019-01-06 10:44:40 +03:00
kd-11	3be4b474d9	rsx: Handle rsx-self-tripping in draw call and triggering invalid invalidation - If draw call resources consume memory that intersects with NA parts of the texture cache, we get a framebuffer test mismatch. This mismatch is false and happens because the thread has not yet reached the point of relocking the pages	2019-01-06 10:44:40 +03:00
kd-11	a95a44cf66	rsx: Strictness cleanups - Also account for variable pitch textures (swizzled scan)	2019-01-06 10:44:40 +03:00
kd-11	474d0f61a2	minor typo fix	2019-01-06 10:44:40 +03:00
kd-11	362eea09a1	whitespace fix only	2019-01-06 10:44:40 +03:00
kd-11	15d5507154	rsx: Rewrite memory inheritance transfers - Implicitly invoke a memory barrier if actively reading from an unsynchronized texture - Simplify memory transfer operations - Should allow more games to work without strict mode	2019-01-06 10:44:40 +03:00
kd-11	97704d1396	rsx: Fix texture size calculations	2019-01-06 10:44:40 +03:00
kd-11	50c07833e4	rsx: Do not force upload for missing data - TODO: Finish implementing GPU RCB for mem-sync - TODO: Refactor mem-sync	2019-01-06 10:44:40 +03:00
kd-11	6d932b042b	vk: bump max number of compute jobs from 120 to 1024 - It is possible without bugs to have a very high number of compute invocations.	2019-01-06 10:44:40 +03:00
kd-11	64a8829614	rsx: Minor cleanup	2019-01-06 10:44:40 +03:00
kd-11	15488eb247	rsx: Avoid unnecessarily touching framebuffer memory - Do not bind companion framebuffer when clearing single aspect; let the contest mechanism sort it out instead - Do not prematurely tag framebuffers, instead only do so at write-confirmation time. Should avoid false tagging if setup does not allow a render to occur.	2019-01-06 10:44:40 +03:00
kd-11	a13986ec5c	vk: Spec fixups - Forgot to update descriptor pool init sizes over time - Also clamp swapchain resources to allowable surface extents	2019-01-05 21:31:12 +03:00
Megamouse	bb464b0b64	fix some warnings	2019-01-05 04:03:18 +01:00
Megamouse	6f7b25de90	implement CELL_PAD_INFO_INTERCEPTED	2019-01-02 15:45:51 +01:00
Megamouse	8ad14c4ada	Overlays: fix input loop when no controllers are detected	2019-01-02 15:45:51 +01:00
eladash	db784556aa	rsx: Evaluate cond render test at set_render_enabled	2018-12-30 15:04:59 +01:00
eladash	568206d11a	Fix rsx capture (again)	2018-12-30 15:04:59 +01:00
Jan Beich	8d308bb4c0	overlays: don't use /proc on BSDs as it may not be mounted	2018-12-29 18:07:45 +03:00
kd-11	9c46386dd4	rsx: Check av configuration when selecting display buffers! - Some applications have mismatch between video output configuration and display buffer sizes	2018-12-24 09:05:19 +03:00
kd-11	7555be232f	rsx/vp: Fix double dst commands - Test the vec_result mask before assigning to actual output Sometimes, VEC op is used to write to Rx, and SCA op is used to write to o[x]!	2018-12-24 09:05:19 +03:00
kd-11	10d96a60f1	rsx/capture: Force flip if no flip event was recorded	2018-12-24 09:05:19 +03:00
kd-11	f48abde14b	rsx: Fixups for immediate rendering mode - Immediate mode is isolated from the rest of the vertex configuration - TODO: Verify register behaviour when immediate mode is used Check if per-primitive const register values are supported (likely are)	2018-12-24 09:05:19 +03:00
kd-11	4b79ef1ad9	rsx: Implement stencil mirror views - Implements a mirror view of D24S8 data that accesses the stencil components. Finishes the implementation of TEX2D_DEPTH_RGBA as the stencil component was previously missing from the reconstructed data - Add a few missing destructors Image classes are inherited a lot and I forgot to make the dtors virtual	2018-12-24 09:05:19 +03:00
kd-11	696b91cb9b	rsx: Reimplement conditional execution in shaders - Per-channel conditional execution introduces RAW hazards all over the place - Its cheaper to process both branches and select between the two - Also improves ShaderVariable functionality to allow functionality such as match_size and taking complex variables as inputs	2018-12-24 09:05:19 +03:00
kd-11	c75749f8ce	rsx: fix flip logic when grabbing output from the surface cache	2018-12-24 09:05:19 +03:00
Megamouse	bc3ab7a9d9	Input: Enable In-Game Pad Config Reset	2018-12-17 19:41:18 +01:00
vit9696	5a40c1802b	Support macOS bundling for binary distribution	2018-12-16 18:17:21 +03:00
eladash	c50d459b1e	cleanup rsx fifo debugger command display	2018-12-15 19:40:18 +03:00
eladash	098d634328	rsx fifo: Fix call cmd offset mask highest 3 bits are masked according to tests, also filter certainly invalid jumps with offset higher than max	2018-12-15 19:40:18 +03:00
eladash	c2aa10cccd	reduce register_pair container	2018-12-15 19:40:18 +03:00
eladash	45ed58cdaf	Fix rsx capture replay Allow to capture non-increment cmd flag that was missing in command.reg	2018-12-15 19:40:18 +03:00
eladash	87988e9da8	rsx fifo: Stability improvements * Restore stack in fifo error handling * Update get register after the cmd execution * Fix put pause in the middle of command * Add restore points when branching to self * Precise nopcmd detection * Test all invalid cmds for early treatment of queue corruption	2018-12-15 19:40:18 +03:00
eladash	835a552d8d	rsx: Implement cellGcmSetNotify	2018-12-15 19:40:18 +03:00
eladash	415b995a54	log rsx get ctrl	2018-12-15 19:40:18 +03:00
eladash	8cbaa8627c	Do not rely on cellPadInit in native ui	2018-12-15 13:51:16 +01:00
Rui Pinheiro	54bfe2e102	Add log warning on slow flush path	2018-12-11 22:37:10 +03:00
Rui Pinheiro	18b9ee4541	Reimplement overlapping fbo "hack" To avoid the need (and performance hit) of Read Color/Depth Buffers, we may not invalidate overlapping fbos inside lock_memory_region unless they are guaranteed to be superseded by the new one. This avoids e.g. issues with overblooming, among others.	2018-12-11 22:37:10 +03:00
Rui Pinheiro	5ab7296665	Fix xcode build	2018-12-11 22:37:10 +03:00
Rui Pinheiro	bcdf91edbb	Misc. Texture Cache fixes	2018-12-11 22:37:10 +03:00
Rui Pinheiro	9d1cdccb1a	Implement dedicated texture cache predictor	2018-12-11 22:37:10 +03:00
Rui Pinheiro	af360b78f2	Texture cache section management fixups Fixes VRAM leaks and incorrect destruction of resources, which could lead to drivers crashes. Additionally, lock_memory_region is now able to flush superseded sections. However, due to the potential performance impact of this for little gain, a new debug setting ("Strict Flushing") has been added to config.yaml	2018-12-11 22:37:10 +03:00
Nekotekina	476090a747	Detach VBlank and RSX Decompiler threads Should fix exception handling in RSX Thread	2018-12-04 23:41:54 +03:00
eladash	45942c4962	Fix segfault when scaled image dimension is less than clip's	2018-12-04 13:01:29 +03:00
eladash	fa5652fceb	rsx image_in: Implement negative scaling	2018-12-04 13:01:29 +03:00
eladash	ce500c75c4	throw exceptions in case of invalid/unknown operations in image_in	2018-12-04 13:01:29 +03:00
eladash	6ecf2fb3d0	rsx: default lv2 semaphore context + dma_4097 extracted from vsh	2018-12-04 13:01:29 +03:00
eladash	28e4a9e0d0	rsx image_in: Fix in_pitch 0 The hw doesnt fix pitch, when specifying src pitch 0 it copies the same pixels line to dst. keep in mind out_pitch = 0 is not allowed in image_in. Same goes for buffer_notify, though it allows out_pitch to be 0.	2018-12-04 13:01:29 +03:00
eladash	d1d3ac984e	rsx image_in: Fix src size calculation when in_pitch != line_lengh	2018-12-04 13:01:29 +03:00
eladash	0a1da14a15	rsx image_in: remove clip h and w hack If clip region is empty, dont execute	2018-12-04 13:01:29 +03:00
eladash	4ddafc481e	remove unreachable code	2018-12-04 13:01:29 +03:00
eladash	b48a4b6459	rsx-capture: reduce capture size * Dont bother capturing 'destination' blocks with no data. instead premap all main memory to ensure allocated * Capture zcull and tile state as their compressed gcm forms * Fix index array capturing, ignore empty sets * hle gcm: Fix byteswaping in cellGcmSetZcull	2018-12-04 13:01:29 +03:00
NicknineTheEagle	32059bfaa2	Properly get PARAM.SFO and icons for C00 games (#5370 ) * Added a helper function for fetching game's PARAM.SFO path This should properly get SFO path for unlocked C00 games * Normalized line endings * Refresh game list after installing a RAP file	2018-12-04 01:46:01 +03:00
kd-11	a56ba737b5	vk: Silence log spam	2018-12-03 20:01:23 +03:00
kd-11	504ab5a6d4	rsx: Minor cleanup to silence stupid compiler warnings	2018-12-03 20:01:23 +03:00
kd-11	f4c28eceef	rsx: Fix null renderer	2018-12-03 20:01:23 +03:00
kd-11	9d0042f509	rsx: Fixup for the flattener - Reset the flattener before use - Better detection of FIFO misalignment	2018-12-03 20:01:23 +03:00
kd-11	ec768afbd9	rsx: Flip workarounds for applications that flip via syscall - Do not assume flip marks end-of-frame if executed via syscall - Also disables skip_frame for these applications as there is no frame boundary - NOTE: QUEUE_HEAD cannot be relied on as it is seemingly possible to flip the same head and not need to queue it	2018-11-30 23:51:25 +03:00
kd-11	2168159d03	gl: Fix flip regression - Restore graphics state after flip (including active fbo) because flip can be made through a syscall	2018-11-30 23:51:25 +03:00
kd-11	b96ed5cd4e	gl: Do not rely on driver statistics for s3TC textures; they are inconsistent.	2018-11-30 23:51:25 +03:00
kd-11	f1c3b46d60	rsx: Fixup - undo vertex cache 'improvements'	2018-11-30 23:51:25 +03:00
kd-11	5b6e1420f3	rsx: Pipeline barriers fixed up - Ensure barriers are invoked even if no draw occurs! -- Ensures that deferred commands are executed eventually	2018-11-30 23:51:25 +03:00
kd-11	8a186bb97e	rsx: Fix insertion of execution barriers - Ignore barriers inserted after BEGIN but before any draw commands are emitted - Properly process tail barriers inserted before END but after draw commands are submitted - Ignore execution barriers with no effect (same register value written)	2018-11-30 23:51:25 +03:00
kd-11	1d19f71a46	rsx: Re-enable fifo error reset	2018-11-30 23:51:25 +03:00
kd-11	718a04c84f	fixup: Clear disabled attrib entries	2018-11-30 23:51:25 +03:00
kd-11	833c25894f	[WIP] rsx: Rebase cleanup	2018-11-30 23:51:25 +03:00
kd-11	5193c99973	rsx: Enable dynamic FIFO preprocessing - Tries to detect when FIFO preprocessing is beneficial and only enables optimizations if the benefit outweighs the cost - Current threshold is at least 500 draw calls saved at over 2000 draw calls to justify the overhead - TODO: More tuning for other CPUs	2018-11-30 23:51:25 +03:00
kd-11	7b065d7781	rsx: Fixup; input attributes blob decoding - Use an unstructured blob and index into the vec4 structures to extract the real data	2018-11-30 23:51:25 +03:00
kd-11	846daadd5d	rsx: Fixups - Improve vertex attribute layout format. Allows for full 16-bit attribute divisor - Use actual pitch when declaring framebuffer rsx pitch instead of register value in case of swizzle? rendering	2018-11-30 23:51:25 +03:00
kd-11	2e32777375	rsx: Scrap the prebuffered queue approach - Basically starting over - The cost of making command copies into the queue has a measurable impact	2018-11-30 23:51:25 +03:00
kd-11	9deecd506a	fixup: It is possible for NOP commands to contain other garbage	2018-11-30 23:51:25 +03:00
kd-11	26a56ef1f1	vk: Spec compliance. - TODO: Implement push_constants path instead of copy + bind descriptor sets	2018-11-30 23:51:25 +03:00
kd-11	d6b4440ef9	gl: Separate vertex env from program env	2018-11-30 23:51:25 +03:00
kd-11	435afcb865	rsx: Fix fifo draw barriers	2018-11-30 23:51:25 +03:00
kd-11	2d88e41583	rsx: Fix some checks when using inlined array rendering	2018-11-30 23:51:25 +03:00
kd-11	54ec363e88	rsx: Critical pipeline fixes - Fix scissor and viewport binding behavior - Fixes recovery if empty scissor is specified and then 'fixed' later - Optimizes state binding a bit	2018-11-30 23:51:25 +03:00
kd-11	1ad76ad331	rsx: Restructure programs - Also re-enable pipeline optimizations	2018-11-30 23:51:25 +03:00
kd-11	b0a6b72ce8	rsx: Optimizations - Replace a few more vectors with simple_array<T> - Avoid unnecessary string comparisons in backends. We already know referenced textures from the program analysers!	2018-11-30 23:51:25 +03:00
kd-11	677b16f5c6	rsx: Fixups - Also fix visual corruption when using disjoint indexed draws - Refactor draw call emit again (vk) - Improve execution barrier resolve - Allow vertex/index rebase inside begin/end pair - Add ALPHA_TEST to list of excluded methods [TODO: defer raster state] - gl bringup - Simplify - using the simple_array gets back a few more fps :)	2018-11-30 23:51:25 +03:00
kd-11	e01d2f08c9	rsx: Refactor FIFO - Removes fifo structures from common RSXThread - Sets up a dedicated FIFO controller - Allows for configurable queue optimizations	2018-11-30 23:51:25 +03:00
eladash	37b6afaf2c	rsx: inlined array stride fix	2018-11-11 23:17:07 +03:00
eladash	75221a6078	rsx: Fix inlined vertex array validation	2018-11-04 22:57:18 +03:00
eladash	fb30c8a937	rsx enums: fix typos	2018-10-30 22:33:59 +03:00
eladash	4069470585	rsx-debugger: ignore invalid cmds basically ignore all non method cmds when scrolling to the next command, not only branches.	2018-10-30 22:33:59 +03:00
Megamouse	d56c85fe01	RSX/Capture: fix filePath and remove strict mode check (#5283 ) - Fixes regression introduced by kd-11 when merging in jarves' flip rework.	2018-10-27 13:06:50 +03:00
eladash	5ee351234c	rsx-capture: unbreak	2018-10-23 18:02:03 +03:00
elad	6829fa0286	rsx: Improve inlined arrays (#5248 ) * rsx: Implement register reads in inlined arrays * rsx: Check for disabled streams in inlined arrays	2018-10-20 16:00:53 +03:00
Nekotekina	1b37e775be	Migration to named_thread<> Add atomic_t<>::try_dec instead of fetch_dec_sat Add atomic_t<>::try_inc GDBDebugServer is broken (needs rewrite) Removed old_thread class (former named_thread) Removed storing/rethrowing exceptions from thread Emu.Stop doesn't inject an exception anymore task_stack helper class removed thread_base simplified (no shared_from_this) thread_ctrl::spawn simplified (creates detached thread) Implemented overrideable thread detaching logic Disabled cellAdec, cellDmux, cellFsAio SPUThread renamed to spu_thread RawSPUThread removed, spu_thread used instead Disabled deriving from ppu_thread Partial support for thread renaming lv2_timer... simplified, screw it idm/fxm: butchered support for on_stop/on_init vm: improved allocation structure (added size)	2018-10-19 22:22:35 +03:00
elad	623f1b35f6	rsx_capture/gcm: Fix tile binding (#5246 ) * gcm: Fix tile offset setting highest bit signifyies location, so ignore that while reading the offset. * rsx-capture: Fix tile binding fixes division by zero when dividing by pitch when the tile is not bound. * rsx-capture: Fix zcull binding	2018-10-12 19:05:08 +03:00
eladash	83b6c98563	rsx: Fix u16 index arrays overflow Force u32 index array destinations to avoid overflows when adding vertex base index.	2018-10-08 16:39:47 +03:00
eladash	e361e0daa6	rsx: Fix restart index check for u16 index arrays Dont ignore upper bits of the restart index with u16 types	2018-10-08 16:39:47 +03:00
Megamouse	49e5212a8f	RSX/Overlays: Add option for japanese button layout	2018-10-03 23:08:33 +02:00
Megamouse	76da3fa907	RSX/Overlays: don't press buttons on every iteration	2018-10-03 21:37:05 +02:00
Megamouse	9693d1c3a3	RSX/Overlays: formatted comments	2018-10-03 21:37:05 +02:00
eladash	348db050ae	rsx: Fix texture height read	2018-10-03 20:57:46 +03:00
eladash	62f97f2e5f	rsx: Fix default texture dimensions haha	2018-10-03 20:57:46 +03:00
eladash	fa723f6dc4	rsx: Fix texture depth read	2018-10-03 20:57:46 +03:00
eladash	a92ae827c1	rsx: Remove texture mipmap hack	2018-10-03 20:57:46 +03:00
eladash	6586090307	rsx: Remove texture size hack	2018-10-03 20:57:46 +03:00
eladash	eacd1b8f13	rsx: Remove texture address hack	2018-10-03 20:57:46 +03:00
Nekotekina	da6ce80f4f	Make vm::get_super_ptr return contiguous memory Cleanup RSX code complexity	2018-09-27 23:37:13 +03:00

... 16 17 18 19 20 ...

3700 commits