rpcsx

mirror of https://github.com/RPCSX/rpcsx.git synced 2026-03-26 07:05:21 +01:00

Author	SHA1	Message	Date
Nekotekina	16401722f1	SPU LLVM: fix $SP passing in functions, write PC on halt Allows to skip updating $SP in optimizable functions.	2019-05-15 15:42:03 +03:00
Nekotekina	b2d0ca83fb	LLVM DSL: simplify value_t template for array	2019-05-15 15:17:36 +03:00
Nekotekina	09eb633f69	SPU ASMJIT: increase stack frame size It seems Windows has minimal stack frame size 0x28.	2019-05-15 02:16:08 +03:00
Nekotekina	3753d27aba	SPU: fix Giga mode (kinda) Don't scan before the entry point. Disable stack mirror in SPU LLVM. Improve analyser logic for holes.	2019-05-14 22:15:04 +03:00
Nekotekina	c481472faf	SPU ASMJIT: add PIC support (fix) Also cleanup and adapt for GHC CC.	2019-05-14 22:15:04 +03:00
Nekotekina	82295d131a	SPU LLVM: split LLVM IR dump to spu-ir.log Also move disasm to spu_recompiler_base::dump. Interleave disasm with block target info for convenience.	2019-05-14 22:15:04 +03:00
Nekotekina	ea554ae828	Implement 'Max SPURS Threads' option (hack) Pauses SPURS threads beyond limit automatically if set.	2019-05-14 22:15:04 +03:00
Nekotekina	1eed421774	SPU LLVM: use branch patchpoints again Renewed and adapted for PIC and all branch types. This may address performance degradation after #5923.	2019-05-14 22:15:04 +03:00
Nekotekina	2f6707d0a0	SPU LLVM: regain some efficiency Avoid returns from the recompiler gateway, favoring tail calls. This may address performance degradation after #5923.	2019-05-14 22:15:04 +03:00
Nekotekina	f33b81545e	SPU: implement recompiler gateway function in assembly Use GHC calling convention directly for SPU object entry points. This may address performance degradation after #5923.	2019-05-14 22:15:04 +03:00
Nekotekina	a74fd27e3d	SPU LLVM: fix SPU termination (spu_escape) on Windows Adjust restored stack pointer for the lack of tail call.	2019-05-14 22:15:04 +03:00
Nekotekina	cc8c635855	SPU: PIC support preview SPU ASMJIT not supported yet. Giga mode not supported properly.	2019-05-14 22:15:04 +03:00
scribam	22f61caf9f	GLTexture: add missing #pragma once directive	2019-05-12 18:32:11 +03:00
scribam	6c5ea068c9	Remove redundant semicolons Fix "-Wextra-semi" warnings	2019-05-12 18:32:11 +03:00
Rui Pinheiro	1f82a26a9c	SPU LLVM: Fix Mega	2019-05-12 00:39:42 +03:00
scribam	3623f4343f	gl/vk: clear scissor_setup_invalid bit along with scissor_config_state_dirty bit	2019-05-11 13:13:49 +03:00
Nekotekina	8194c92f1c	SPU LLVM: disable GHC CC for chunks on Windows Causes fatal error inside LLVM.	2019-05-11 02:35:16 +03:00
Nekotekina	5d33d9a3d9	Enable most warnings in GCC	2019-05-11 02:13:19 +03:00
Nekotekina	7492f335e9	SPU analyser: basic function detection in Giga mode Misc: fix EH frame registration (LLVM, non-Windows). Misc: constant-folding bitcast (cpu_translator). Misc: add syntax for LLVM arrays (cpu_translator). Misc: use function names for proper linkage (SPU LLVM). Changed function search and verification in Giga mode. Basic stack frame layout analysis. Function detection in Giga mode. Basic use of new information in SPU LLVM. Fixed jump table compilation in SPU LLVM. Disable broken optimization in Accurate xfloat mode. Make compiled SPU modules position-independent in SPU LLVM. Optimizations include but not limited to: * Compiling SPU functions as native functions when eligible * Avoiding register context write-out * Aligned stack assumption (CWD alike instruction)	2019-05-11 02:13:19 +03:00
Megamouse	fce9d6a7b8	Qt/input: add LED color picker to pad settings dialog	2019-05-09 22:02:00 +02:00
eladash	7ead021aa7	rsx: Fix 3d swizzled texture to linear conversation	2019-05-08 23:48:39 +03:00
eladash	13d8e33d9a	Return ESRCH if ppu thread ID was not found in sys_cond_signal_to	2019-05-07 08:58:07 +03:00
eladash	4e2650af91	Fix sys_rwlock_wlock timedout event If the rwlock is currently acquired by a writer signaling readers is wrong and will lead to EPERM for wunlock! Only signal blocked readers if the rwlock is currently acquired by readers	2019-05-07 08:58:07 +03:00
eladash	ca08418dc1	Fix sys_rwlock_runlock on waiting readers readers can wait on the sleep queue if a writer lock has been blocked before it, in this case after runlock: writer should acquire the lock but the r's sleep queue is still not empty!	2019-05-07 08:58:07 +03:00
Megamouse	5141590729	overlays: add separate timestamp for the start of the d-pad interval	2019-05-06 22:00:40 +02:00
Malcolm Jestadt	fd2bc95a7b	overlays: Double dpad repeat rate	2019-05-06 22:00:40 +02:00
Megamouse	c1e245ae73	Emu: msg_dialog_frame fixup: don't reject on Close to prevent Emu.Stop()	2019-05-05 16:29:50 +02:00
Megamouse	b639584acc	Emu/Qt: Fix Boot Recent when using BootGame(add_only=true)	2019-05-05 16:29:50 +02:00
Megamouse	b0a24665e5	Emu: msg_dialog_frame fixes	2019-05-05 16:29:50 +02:00
kd-11	9c346c92f3	gl: undo an accidental deletion	2019-05-05 13:37:55 +03:00
kd-11	2bec304cca	vk: Allow some drivers to bypass window polling if not needed	2019-05-05 13:37:55 +03:00
Nekotekina	a703460fc6	SPU ASMJIT: skip some unused analyser steps May improve performance	2019-05-04 19:35:13 +03:00
Nekotekina	ba1ec1d5d6	SPU analyser: remove use_ra from HBR Since this is a hint instruction, we don't really use reg value here.	2019-05-04 18:33:58 +03:00
Nekotekina	45ce8db6cb	SPU Analyser: fix reg origin regression Propagate phi instead of claiming new values	2019-05-04 18:29:47 +03:00
Nekotekina	4bd022f778	SPU analyser: minor logic fix and cleanup Don't fill any chunk info for now (design mistake).	2019-05-03 14:18:22 +03:00
Nekotekina	6c34d7104e	SPU analyser: fix excessive workload list size Typo grade; regression	2019-05-02 23:29:02 +03:00
Nekotekina	54dc617f39	SPU analyser: internal spu_itype optimization Use only 1 byte for instruction type. Flags are transformed into range comparisons.	2019-05-02 19:32:09 +03:00
Nekotekina	15bd3b8724	SPU: fix minor UB in STQD/LQD instructions	2019-05-02 18:00:49 +03:00
Nekotekina	2b4da18709	SPU LLVM: fix xfloat regression It was an old bug with possible hidden use of deleted instructions.	2019-05-02 13:39:43 +03:00
Nekotekina	d48dc29e55	SPU LLVM: fix perf regression Bug in the analyser was created recently in #5882.	2019-05-02 13:39:43 +03:00
Nekotekina	69d2ea35b9	SPU: minor analyser cleanup	2019-05-02 13:39:43 +03:00
Nekotekina	a4c4ee9cb2	SPU: fix excessive cache size regression	2019-05-02 13:39:43 +03:00
kd-11	6b7cd458e3	rsx: Silence some diagnostics unless compiled with debugging options	2019-05-01 15:36:21 +03:00
kd-11	1d5c52f476	rsx: Ignore stencil clear flag if the stencil write mask is disabled	2019-05-01 15:36:21 +03:00
kd-11	48cb265c2c	rsx: Bounds check on local resource for atlas merge. - Local resources can also have padded pitch dimensions and false-positives on range overlap tests	2019-05-01 15:36:21 +03:00
kd-11	63f9b8e0c6	gl/vk: Minor cleanup	2019-05-01 15:36:21 +03:00
kd-11	ec9aa74008	rsx: Fix section base offset calculation for blit_dst targets which affects confirmed memory range - Fixes flushes only writing partially to target memory	2019-05-01 15:36:21 +03:00
kd-11	4e3ec162e2	rsx: Fix broken texture cache search when flipping	2019-05-01 15:36:21 +03:00
kd-11	6feffe6ff6	rsx: Ignore transfer offsets when wrapping behaviour is expected	2019-05-01 15:36:21 +03:00
kd-11	f56a6548b0	gl: Remove workaround for AMD driver bug fixed in driver 19.4.3	2019-05-01 15:36:21 +03:00
Nekotekina	1bc5e27507	SPU LLVM: move reg origin search to analyser Refactor SPU analyser (block_info struct). Fill register use info (currently unused).	2019-05-01 00:37:15 +03:00
Nekotekina	1294e0d189	SPU LLVM: improve codegen in loops Use a trick in check_state to improve LICM pass.	2019-05-01 00:37:15 +03:00
Nekotekina	e09c6ea4b4	SPU analyser: add spu_iflag Register information about register accesses.	2019-04-30 14:33:27 +03:00
Nekotekina	716737ecf2	LLVM DSL: expression matching (alpha) Implement remaining instructions. Implement match_expr method. Implement helper methods.	2019-04-30 14:33:27 +03:00
eladash	3bd29b8bac	Fix Unregistered HLE function access	2019-04-29 23:04:16 +03:00
eladash	ea1c9a2e17	Fix PPU Breakpoints and ppu_check_toc	2019-04-29 23:04:16 +03:00
kd-11	243df38360	rsx: Fix VP writes to CC with a MOV instruction - When moving to CC, the operation has VEC flag disabled and also temp regs disabled. Looks to be the catch-all ELSE in the selection logic.	2019-04-25 16:23:05 +03:00
kd-11	3cbccdd760	rsx: Fragment shader decompiler cleanup TODO: Investigate the _s input modifier behaviour further, in case it can avoid generating zeroes from a MAD instruction. x = MAD(+ve, -ve, -ve) with _s input modifier in BFBC expects result to be Non-zero	2019-04-25 16:23:05 +03:00
kd-11	4cd1c25729	"rsx: Ignore argument sign for SQRT operations"	2019-04-25 16:23:05 +03:00
kd-11	32396ba366	rsx: Simplify use of some mixed input functions using OPFLAGS to avoid implicit conversions	2019-04-25 16:23:05 +03:00
kd-11	f12bd8068c	rsx: Fragment decompiler fixups - Properly test for NaN and Inf when clamping down to fp16 - Optimize divsq a bit; mix(vec, vec, bvec) emits OpSelect which is what we want here, instead of component-wise selection which is much slower.	2019-04-25 16:23:05 +03:00
kd-11	abe7188acf	rsx: Proper workaround for broken DIVSQ instruction on realhw - While mul(0, nan) = nan and 0 / 0 = nan, 0 / sqrt(0) = 0 because of hw gremlins. normalize(0) is also nan so this behaviour does not work around that particular case either which makes it even more baffling.	2019-04-25 16:23:05 +03:00
kd-11	60f3059d22	rsx: Compensate for nvidia's low precision attribute interpolation - The hw generates inaccurate values when doing perspective-correct interpolation of vertex output attributes and makes the comparison (a == b) fail even when they are a fixed constant value. - Increase equality tolerance when doing comparisons in fragment shaders for NV cards only to work around this issue. - Teepo fix	2019-04-25 16:23:05 +03:00
kd-11	463b1b220d	rsx: Improve accuracy of shadow compare Ops when non-integer depth formats are used - The fixed-point D24S8 format does special Z clamping during compare which matches PS3 behaviour - D32S8 is a floating point format and comparison with Dref > 1 always fails causing black edges/borders	2019-04-25 16:23:05 +03:00
kd-11	7ad1646c2c	vk: Skip feature check if extension is not supported	2019-04-25 16:23:05 +03:00
kd-11	06a85f00d1	rsx: Shader decompiler cleanup and improvements - Improve support for float16_t by minimizing mixed inputs to functions (ambiguous overloads) - Minimize amount of downcasts in code by using opcode flags - Re-enable float16_t support for vulkan	2019-04-25 16:23:05 +03:00
kd-11	a668560c68	rsx: Use native half float types if available - Emulating f16 with f32 is not ideal and requires a lot of value clamping - Using native data type can significantly improve performance and accuracy - With openGL, check for the compatible extensions NV_gpu_shader5 and AMD_gpu_shader_half_float - With Vulkan, enable this functionality in the deviceFeatures if applicable. (VK_KHR_shader_float16_int8 extension) - Temporarily disable hw fp16 for vulkan	2019-04-25 16:23:05 +03:00
kd-11	ee319f7c13	rsx: Implement strict clamp16 operation needed for NVIDIA cards	2019-04-25 16:23:05 +03:00
Nekotekina	2ade3c594c	LLVM DSL: expression matching (preview 2) Implement more instructions.	2019-04-25 03:33:18 +03:00
Nekotekina	aca61fdcf9	LLVM DSL: implement expression matching (preview) Only literal match for binary ops implemented.	2019-04-24 23:55:41 +03:00
Nekotekina	8754bbd444	SPU LLVM: add match_vr<> template Returns reg value only if type is compatible, avoiding bitcast.	2019-04-24 23:55:41 +03:00
Nekotekina	dd9bd1338b	SPU LLVM: add get_vrs<> template	2019-04-24 23:55:41 +03:00
Nekotekina	3e0b45719d	LLVM DSL: rewrite zshuffle, shuffle2, build Add llvm_const_vector template.	2019-04-24 23:55:41 +03:00
Nekotekina	b02503963e	LLVM DSL: rewrite splat, fsplat, vsplat Add llvm_const_float and llvm_splat templates.	2019-04-24 23:55:41 +03:00
Nekotekina	c83e65f29e	LLVM DSL: rewrite extract and insert	2019-04-24 23:55:41 +03:00
Nekotekina	b7b93eae13	SPU LLVM: minor bitcast cleanup Remove redundant explicit constand propagation in some instructions.	2019-04-24 23:55:41 +03:00
Nekotekina	2eac59f59a	LLVM DSL: rewrite add_sat and sub_sat Simplify constant folding logic	2019-04-24 23:55:41 +03:00
Nekotekina	ac473eb400	Rewrite cpu_translator::rol, add fshl and fshr Use new funnel shift intrinsics	2019-04-24 23:55:41 +03:00
Nekotekina	42448cf3e5	Remove cpu_translator::scarry, cpu_translator::merge	2019-04-24 23:55:41 +03:00
Nekotekina	524aac75ed	LLVM DSL: rewrite bitcast, zext, sext, trunc, select, min, max ops Are made composable in expressions similar to arithmetic ops. Implement noncast in addition to bitcast (no-op case). Implement bitcast constant folding. Fixed some misuse of sext<>.	2019-04-24 23:55:41 +03:00
Nekotekina	3925cb59ac	LLVM DSL: fix pointer type traits Clear and match 'void' type	2019-04-24 23:55:41 +03:00
Nekotekina	dc9118ef50	LLVM DSL refactoring Properly forward value categories in expression structs. Simplify SFINAE tests (is_llvm_expr, llvm_common_t) in global operators. Add llvm_const_int and remove llvm_add_const, llvm_sub_const, etc. Add llvm_ord and llvm_uno for FP comparison via >=< operators. Replace cpu_translator::fcmp with fcmp_ord and fcmp_uno.	2019-04-24 23:55:41 +03:00
Ani	a24ede4f40	cellPad: Update vendor and product IDs - Used IDs were not from the Guitar Hero instruments but in fact from the Rock Band ones. Sets the correct Guitar Hero IDs and adds the Rock Band ones on comments. TODO: Allow selecting the specific devices on the PAD Settings. - Adds DJ Hero Turntable VID/PID. - Adds Dance Dance Revolution Mat VID/PID.	2019-04-20 23:17:13 +01:00
eladash	3a5f4ed757	Print SPU Group ID on the debugger	2019-04-20 20:43:58 +01:00
eladash	450e2c9a0e	cellSaveData: Add missing SDK version check for setParam->reserved2 check	2019-04-20 20:43:58 +01:00
eladash	ae5a4b697e	Fix cellSaveDataListAutoSave/Load unk flags	2019-04-20 01:04:41 +03:00
eladash	1e9d3346d1	Reschedule in cellMsgDialogOpen2	2019-04-20 01:04:41 +03:00
eladash	9446bd2d3f	Handle a few more cellSaveData errors * Check directory existence if setParam is NULL (dont create directory) * Fix mask for reCreateMode * Check a few setParam fields including reserved buffers. * Fix sizeKb when the dir is empty except from PARAM.SFO * Fix error checking when CELL_SAVEDATA_RECREATE_YES is specified but setParam is NULL (Doesnt do anything, simply errors)	2019-04-20 01:04:41 +03:00
eladash	6f76e34104	rsx: Fix race on clearing native_ui vs emu_requested flag	2019-04-20 01:04:41 +03:00
eladash	67f098627a	Fix sys_spu group ID	2019-04-20 01:04:41 +03:00
eladash	ff11d9a3bd	Improve scheduler control for cellSaveData TODO: There are probably more spots where we should yield. A little more at the start because PacketRead is called twice. Dont use sys_timer_usleep because it will just call this_thread::yield() repeatedly.	2019-04-20 01:04:41 +03:00
eladash	9497270da5	Implement initial arguments error checking for cellSaveData	2019-04-20 01:04:41 +03:00
eladash	2b4bc588dc	Put missing check_state() in some places Fixes a few verification failures while closing the emulator with HLE liblv2	2019-04-20 01:04:41 +03:00
eladash	888cb9d673	Remove reader_lock executed in every instruction by RSX Use optimistic double check instead, use one load instruction for the check to be atomic + Read emu status once every FIFO iteration	2019-04-20 01:04:41 +03:00
eladash	f25587d24c	rsx: Write vblank semahpre, minor semaphore acquire optimization	2019-04-20 01:04:41 +03:00
eladash	777a99d01b	misc: Lower default perf overlay detail Because RSX Guest utilization confuses people and is only meaningful for debugging.	2019-04-18 22:23:05 +03:00
Nekotekina	7865982208	Fix static_hle log channel definition	2019-04-16 23:49:18 +03:00
Nekotekina	52c589ed3d	Revert disabling AVX path in SPU verification. It was experimental and builds for tests are available in history.	2019-04-16 23:49:18 +03:00
Nekotekina	9060177dbd	SPU transactions: add SSE path if AVX is not available This handles hypothetical situation when AVX is disabled system-wise. Also refactored register use, to match Windows path with Linux path. This reduces read set a little at the cost of stack use.	2019-04-16 23:49:18 +03:00
Megamouse	b929c13c45	implement get_firmware_version add firmware version to the first line in the log	2019-04-16 22:13:28 +02:00
msuih	baf42430d6	Decrease severity of sys_net_bnet_close	2019-04-16 18:39:57 +03:00
Nekotekina	136fc8cfe3	SPU ASMJIT: avoid AVX in verification (experimental)	2019-04-14 18:03:45 +03:00
Nekotekina	c1edae73c5	SPU ASMJIT: move vzeroupper a bit	2019-04-14 15:10:54 +03:00
Nekotekina	8deb20e928	SPU: write cache before compiling	2019-04-13 22:56:11 +03:00
kd-11	df3b46a611	rsx: Improve texture sourcing and clipping when reverse scanning is enabled - When reverse scanning, offsets are inverted and offset value of 0 is logically equivalent to an offset of -1 - Add an explicit message if clipping happens to avoid silent errors/bugs	2019-04-12 15:36:21 +03:00
Nekotekina	0d415407c7	sys_fs_unlink: add CELL_EISDIR check	2019-04-12 12:24:36 +03:00
Nekotekina	f40320bcae	Fix cellVdecOpen Use pseudo-address in sys_ppu_thread_create calls	2019-04-11 21:20:22 +03:00
TGEnigma	38cc92ec45	Add _sys_ppu_thread_create and sys_ppu_thread_rename error checks	2019-04-11 18:14:05 +03:00
eladash	8da78c098c	SPU LLVM: Fix branch to self at start of block state check	2019-04-11 17:47:52 +03:00
eladash	eba8e2284b	SPU LLVM: Fix CFLTU Clamp properly result from both sides! TODO: Figure out whats different CreateFPToUi has from CFLTU and why it fails here.	2019-04-11 17:47:52 +03:00
eladash	969af86eba	SPU: Implement BISLED DFCMGT instruction removed, it was wrong to add to begin with ASMJIT: Fix compilation of double compare instructions, move exception to runtime instead of compiletime! Jarves confirmed that he implemented this instruction because of that bug with asmjit only, affected God Of War 3	2019-04-11 17:47:52 +03:00
eladash	b307aff9eb	Prefetch byteswapped opcodes in ppu interpreter	2019-04-11 17:47:52 +03:00
eladash	1c462abc37	Make sure to update cia when calling in unknown hle table func access	2019-04-11 17:47:52 +03:00
eladash	3304e3b0b7	PPU LLVM: Fix STSWI and LSWI	2019-04-11 17:47:52 +03:00
eladash	f028737db8	Implement fallback for PPU LLVM This matches with interpreter implementation, fixing unregistered functions in lost cases	2019-04-11 17:47:52 +03:00
eladash	a9014a8cac	ppu Fast/Precise: Fix SIMD instructions VSUM2SWS, VPKSWSS, VPKSHUS, VPKSHSS Also rewrite VPKSHUS for speed.	2019-04-11 17:47:52 +03:00
eladash	e21504d52d	ppu interpreter: Improve FPCC field handling	2019-04-11 17:47:52 +03:00
eladash	aa44ef1f44	Fix default PPU nj status TODO: Support it...	2019-04-11 17:47:52 +03:00
eladash	d555eeb0f4	Check start status in sys_prx_start/stop_module	2019-04-11 17:47:52 +03:00
Inviuz	52a12185a0	Initial sys_overlay	2019-04-10 23:25:09 +03:00
scribam	1d947daa81	hle: Add some more functions	2019-04-10 22:15:35 +03:00
Nekotekina	40142420c1	Implement vfs::host::unlink Emulate POSIX behaviour in sys_fs_unlink. This should allow to delete opened files transparently on Windows.	2019-04-10 13:58:12 +03:00
Nekotekina	9736773c04	Implement vfs::host::rename With spurious access error workaround	2019-04-10 13:58:12 +03:00
Nekotekina	3354f068fc	PPU/SPU transactions: ease cache line interference (TSX path) Touch memory on the same memory page, but different cache lines.	2019-04-10 13:58:12 +03:00
kd-11	12dc3c1872	vk: Dynamic heap management to potentially fix ring buffer overflows - Allows checking one heap type at a time, on demand - Should avoid OOM situations unless inside an uninterruptible block	2019-04-09 13:40:54 +03:00
kd-11	a4495c35b7	rsx: Fixups for swizzled texture scanning - Revert to using block metrics, but with optional per-channel decode stage for the final transfer. Much cleaner than hacking in the width to be in channels instead of blocks.	2019-04-09 13:40:54 +03:00
kd-11	a5ed30a8c0	rsx: Fixups for data cast operations via typeless transfer	2019-04-09 13:40:54 +03:00
kd-11	f04a0a2bb6	rsx: Remove some old restrictions affecting memory persistence	2019-04-09 13:40:54 +03:00
kd-11	0a604e39f1	rsx: Implement RGB655 decode	2019-04-09 13:40:54 +03:00
kd-11	cc3809fbfe	gl: Register a few more missing formats for conversion	2019-04-09 13:40:54 +03:00
kd-11	e4e86455f2	rsx: Fix temporary subresource caching behaviour - Do not cache if a gathered subresource contains a bound RTT - Change op to dynamic copy if parent is still bound	2019-04-09 13:40:54 +03:00
kd-11	3249000511	rsx: Improvements to texture scanning - Removes CPU-only transforms that broke GPU-side code. -- Channels in GPU compute are laid out in cell-order, but CPU was uploading in favorable order and compensating with swizzles. -- This leads to 2 different layouts depending on the location of the data (CPU vs GPU) - Implement R8G8_R8B8 interleaved format decode - General improvements	2019-04-09 13:40:54 +03:00
kd-11	0f7af391d7	vk: Implement copy-to-buffer and copy-from-buffer for depth_stencil formats - Allows D24S8 and D32S8 transport via typeless channels - Allows uploading and downloading D24S8 data easily - TODO: Implement optional byteswapping to fix flushed readbacks with the same method	2019-04-09 13:40:54 +03:00
kd-11	366e4c2422	rsx: Preliminary support for format conversions using typeless resolve	2019-04-09 13:40:54 +03:00
kd-11	b7470cfc1a	rsx: Tighten format checks in cache hit tests	2019-04-09 13:40:54 +03:00
kd-11	443fde760f	rsx: Blit engine clipping fixes - Do not round up sub-pixel offsets, round down instead - Do not allow incomplete sources for hw blit transfer - Reimplement src clipping (slice_h) - Check 'area' of incoming texels and correct for them before RTT lookup/transfer - Filter out incomplete targets when performing RTT lookup (1 texel or less contribution)	2019-04-09 13:40:54 +03:00
scribam	f30af3ccd2	hle: Add more missing functions	2019-04-07 23:31:15 +03:00
scribam	8dbf2638e2	hle: Add some missing functions 0xBA50BC23 => cellCelpEncOpenExt 0x1AC58D11 => cellHttpFlushCache 0xA39FE9DC => cellHttpEndCache 0xB4FA3111 => cellHttpInitCache 0x4A18A89E => sceNpMatchingSetRoomInfoNoLimit 0xB020684E => sceNpMatchingGetRoomInfoNoLimit	2019-04-05 17:28:10 +01:00
Alex James	d7ad991b7e	Fix macOS compilation This is needed for GL/glew.h to be found.	2019-04-02 03:16:55 +03:00
eladash	182054b8af	Implement sys_vm_append/return_memory	2019-03-31 14:57:21 +03:00
eladash	3c0564c9b7	Fix timer state after event queue was destroyed * Hw tests show state is unaffected by external destruction of the event queue * Minor race regarding state check fixed (can result in an undestroyable state)	2019-03-31 14:57:21 +03:00
eladash	90490f775d	Fix sys_timer_usleep specifically with 0 sleep time Remove context switch, replace it with host yield() for giving some cpu time for SPUs ans RSX	2019-03-31 14:57:21 +03:00
eladash	8185ef7610	rsx: Improve vblank accuracy	2019-03-31 14:57:21 +03:00
eladash	801e6114b6	rsx: Use relaxed store on fifo ctrl registers	2019-03-31 14:57:21 +03:00
eladash	a3f65084df	Fix sys_process_exit2 when SPUs are at av handler	2019-03-31 14:57:21 +03:00
eladash	1ed2055ec1	Fix cellVdecGetPicItem element popping behaviour	2019-03-31 14:57:21 +03:00
eladash	f2bbae9db4	Remove handle in cellVdecClose	2019-03-31 14:57:21 +03:00
eladash	8eb59271a5	Improve error checking of cellVdecOpen Those are the initial argument checks done by the firmware	2019-03-31 14:57:21 +03:00
eladash	d6025c6764	Fix cellPadGetInfo port status returned ASSIGN_CHANGES flag is not returned in this func.	2019-03-31 14:57:21 +03:00
eladash	47ca1b1dda	Minor optimizations in cellPad - Dont bother with shared_ptr since all pad_t management is going under the pad mutex. - Change m_pads type into std::array since its size is known	2019-03-31 14:57:21 +03:00
eladash	6502d933df	Fix stack memory view on the debugger the debugger uses super ptr which was unmapped for stack.	2019-03-31 14:57:21 +03:00
scribam	f15eb88f59	hle: Fix cellSysutilAvcExt module And add cellSysutilAvcExtSetChatMode and cellSysutilAvcExtSetChatGroup functions	2019-03-31 00:55:55 +03:00
scribam	1916cc1691	hle: Add cellSysmoduleUnloadModuleInternal and cellSysmoduleLoadModuleInternal functions	2019-03-30 23:52:56 +03:00
scribam	f369aeab7a	hle: Add FT_Get_X11_Font_Format function	2019-03-30 23:52:56 +03:00
scribam	0e9313d2df	hle: Add cellFontInitLibraryFreeType function	2019-03-30 23:52:56 +03:00
Nekotekina	d873802b9c	Use LLVM 9 Use new add/sub with saturation intrinsics	2019-03-30 01:36:48 +03:00
Nekotekina	7e0b941e9f	PPU LLVM: implement get_vrs<>() adaptor Make use of structured bindings	2019-03-30 01:36:48 +03:00
Nekotekina	d77fed6105	SPU LLVM: remove wrong dead code	2019-03-29 17:00:53 +03:00
scribam	a254a203bb	hle: Add libad_async and libad_core modules	2019-03-27 21:41:44 +00:00
Nekotekina	71b88cdc82	New SPU interpreter (SPU fast) Use LLVM to build SPU interpreter. Simplify interpreter loop.	2019-03-27 20:33:44 +03:00
scribam	a9eb321814	hle: Add sceNpEulaAbort function	2019-03-26 23:19:01 +03:00
scribam	956d039415	hle: Add cellVideoPlayerUtility module	2019-03-24 19:16:49 +03:00
scribam	581b205f73	hle: Add cellPesmUtility module	2019-03-24 19:16:49 +03:00
scribam	6c40b9f3e0	hle: Add cellDtcpIpUtility module	2019-03-24 19:16:49 +03:00
scribam	d6bf18eabc	hle: Add some sceNpMatchingInt functions	2019-03-24 17:29:18 +03:00
scribam	32ae7e466c	hle: Add cellNetAoi module	2019-03-24 17:29:18 +03:00
Nekotekina	7ea04d5d76	Minor optimization in SPU analyser Reduce vector copy/allocation	2019-03-23 02:43:41 +03:00
Nekotekina	4b381fbbb1	Implement spu_runtime::reset To handle JIT: Out Of Memory error.	2019-03-23 02:43:41 +03:00
Nekotekina	1880a17f79	SPU recs: implement spu_runtime::find Use this function to link to existing functions from branch patchpoints. Don't compile from branch patchpoints.	2019-03-23 02:43:41 +03:00
Nekotekina	31304f4234	SPU rec: refactor some trampoline generation Move branch/dispatch trampoline generation at startup.	2019-03-23 02:43:41 +03:00
Nekotekina	3794f65bb6	Add cpu_flag::jit_return	2019-03-23 02:43:41 +03:00
Nekotekina	849411693a	PPU LLVM: add MemoryManager3 For temporary allocations. Add flags in jit_compiler constructor.	2019-03-23 02:43:41 +03:00
Nekotekina	466d58ccef	SPU LLVM: fix branch patchpoints Forgot to passthrough 3rd arg (rip)	2019-03-23 02:43:41 +03:00
kd-11	41b87cf577	rsx: Blit engine fixes - If a transfer writes to a RTT and depth mismatch happens, create a local target and the upload function will likely resolve between the two - If a surface is rejected, reset the target region!	2019-03-22 21:27:15 +03:00
kd-11	86ad204636	rsx: Rebase output region when using upload-fallback path	2019-03-22 21:27:15 +03:00
kd-11	dbc8e70ddd	rsx: Silence some compiler noise	2019-03-22 21:27:15 +03:00
kd-11	3a4e3fa53a	rsx: Fix use-after-modify condition when inserting a draw command out of order - Fixes barrier->range rebase after the insert	2019-03-22 21:27:15 +03:00
kd-11	d731c07ade	vk: Fix typeless resource management - Fixes bugs that appear with high resolution scaling	2019-03-22 21:27:15 +03:00
kd-11	adc59f9810	rsx: Fix blit transfers when texel sizes mismatch - Also refactors some bpp handling code - Simplify texture intersection test to use a normalized/uniform coordinate space - Fix broken bounds checking as well	2019-03-22 21:27:15 +03:00
kd-11	b879b32271	rsx: Fix bpp calculation taking resolution scaling into account - Do not rely on image->width(), use surface_width() instead for unscaled values - Refactor/clean GL rendertarget class a bit	2019-03-20 10:05:54 +03:00
kd-11	03fca73cf4	rsx: Fix blit intersection falling outside the available texture - Just becaue we have a hit inside the tile of interest does not guarantee that it sits inside the texture!	2019-03-20 10:05:54 +03:00
RipleyTom	63bbe459ea	DS3 pad handler	2019-03-18 19:05:02 +03:00
kd-11	3ef16bee47	rsx: Fix texture lookups and avoid out-of-bounds copies/transfers	2019-03-17 21:50:11 +03:00
kd-11	bb65e45614	rsx: Implement GPU acceleration for rotated images	2019-03-17 21:50:11 +03:00
kd-11	5260f4b47d	rsx: Improvements to memory flush mechanism - Batch dma transfers whenever possible and do them in one go - vk: Always ensure that queued dma transfers are visible to the GPU before they are needed by the host Requires a little refactoring to allow proper communication of the commandbuffer state - vk: Code cleanup, the simplified mechanism makes it so that its not necessary to pass tons of args to methods - vk: Fixup - do not forcefully do dma transfers on sections in an invalidation zone! They may have been speculated correctly already	2019-03-17 21:50:11 +03:00
kd-11	385485204b	vk/gl: Omit unlocked data when grabbing flip sources from texture cache	2019-03-17 21:50:11 +03:00
kd-11	74eeacd091	vk/gl: Improve memory tag sync and test - Properly pass parameters such as rsx-pitch to the surface store - Do not crash if a surface fails verification in flip, use fall-back instead	2019-03-17 21:50:11 +03:00
kd-11	1a44446250	rsx: Fix dst upload block region - The section needed starts at image origin, not transfer origin!	2019-03-17 21:50:11 +03:00
kd-11	a49a0f2a86	vk/gl: Synchronization improvements - Properly wait for the buffer transfer operation to finish before map/readback! - Change vkFence to vkEvent which works more like a GL fence which is what is needed. - Implement supporting methods and functions - Do not destroy fence by immediately waiting after copying to dma buffer	2019-03-17 21:50:11 +03:00
kd-11	85cb703633	rsx/cache: Debugging bugs introduced by the atlas coverage check - Figured out why it breaks things, ofc can't actually check for coverage when there is no proper fbo data persistence	2019-03-17 21:50:11 +03:00
kd-11	3a4083263e	rsx: Fix texture transfer when pitch does not match exactly	2019-03-17 21:50:11 +03:00
kd-11	612160a8ff	rsx: Fix zero-pitch textures - Assumption here is that only texel (0, 0) is accessible. Inline with other pitch 0 operations. - TODO: Verify pitch 0 does not advance in Y either	2019-03-17 21:50:11 +03:00
kd-11	17c49d21a5	rsx/blit: Remove workarounds/hacks added for master. Start implementation/stubs for blit engine rotations in GPU	2019-03-17 21:50:11 +03:00
kd-11	745f8f9627	rsx: Remove pointless assert	2019-03-17 21:50:11 +03:00
Nekotekina	e9b6beadfc	SPU LLVM: implement static branch weights May help branch prediction in some cases	2019-03-13 21:14:55 +03:00
Nekotekina	388d49db80	SPU LLVM: fix SPU MMIO in TSX mode	2019-03-13 21:14:55 +03:00
Nekotekina	688aabc6c9	Add _sys_lwmutex_unlock2 syscall name	2019-03-12 23:55:13 +03:00
eladash	4a28319edf	Implement SPU page faults notifications * Implement both RawSPU and threaded SPU page fault recovery * Guard page_fault_notification_entries access with a mutex * Add missing lock in sys_ppu_thread_recover_page_fault/get_page_fault_context * Fix EINVAL check in sys_ppu_thread_recover_page_fault, previously when the event was not found begin() was erased and CELL_OK was returned. * Fixed page fault recovery waiting logic: - Do not rely on a single thread_ctrl notification (unsafe) - Avoided a race where ::awake(ppu) can be called before ::sleep(ppu) therefore nop-ing out the notification * Avoid inconsistencies with vm flags on page fault cause detection * Fix sys_mmapper_enable_page_fault_notification EBUSY check from RE it's allowed to register the same queue twice (on a different area) but not to enable page fault notifications twice	2019-03-12 13:28:31 +03:00
kd-11	1875dc3f18	gl: Fix buffer size calculations	2019-03-10 16:09:05 +03:00
kd-11	358558aaa7	cleanup and fixups	2019-03-10 16:09:05 +03:00
kd-11	04dda44225	rsx: Properly generate render target data with all parameters provided - Build-up to variable-sized framebuffers and AA implementation - Also allows accurate range calculation for our hit testing	2019-03-10 16:09:05 +03:00
kd-11	21bc6c7a87	rsx: Properly resolve data for upload when needed. - Avoids blindly reusing blit dst sections as they may contain garbage. If a section was unlocked for a flush, just discard it as its reuse introduces potential data corruption. Since the data needs to be reuploaded anyway (for now), its better to start afresh - In case of format mismatch, reset the calculated dst block - Add a bounds check to determine if data contained in an atlas is good enough for sampling the cache. If not enough data is provided, fall back to full upload	2019-03-10 16:09:05 +03:00
kd-11	9d4d3d9443	rsx: Reimplement render target intersection tests when using hw accelerated blit engine - Properly collapse memory tree when scanning in case of overlaps!	2019-03-10 16:09:05 +03:00
kd-11	f4ebcb0029	rsx: Properly decode packed renders from the type flag - Seems to occupy bits [8-9]	2019-03-10 16:09:05 +03:00
kd-11	7c379432dd	rsx: Implement proper pitch compatibility lookup - When a single row is required or is all that is available, pitch has no meaning as the coordinate space changed to 1D	2019-03-10 16:09:05 +03:00
kd-11	dccb4a4888	rsx/texture_cache: fixes to commit_framebuffer_memory	2019-03-10 16:09:05 +03:00
kd-11	b9e7b085fe	rsx/texture_cache: Fixups for local resource hit and fast-path added	2019-03-10 16:09:05 +03:00
kd-11	a80f1a6ed4	gl: Fix memory tag sampling - Also fixes a bad arg passed to glClearBuffer	2019-03-10 16:09:05 +03:00
kd-11	0395fb9955	rsx/tecture_cache: Addendum - fix data cast with scaling conversion (AA emulation) - Blit operations do format conversion automatically which is NOT what we want! - Scale onto temp buffer with similar format before performing data cast.	2019-03-10 16:09:05 +03:00
kd-11	10dc3dadee	rsx/texture_cache: Improve framebuffer memory locking when WCB/WDB is not enabled - Adds a new mode that removes non-framebuffer stuff inside framebuffer range	2019-03-10 16:09:05 +03:00
kd-11	563e205a72	rsx/texture_cache: Fix 'AA' scaling hack and restore collection template selection	2019-03-10 16:09:05 +03:00
kd-11	fa628f0ac4	rsx/surface_store: More aggressive tag sampling - Use a 5-point tap with an X pattern across the target's memory space to reduce chances of false positives - TODO: Potential false positives identified, requires some minor restructuring of surface_store	2019-03-10 16:09:05 +03:00
kd-11	3a071a9c07	rsx: Texture search rewrite - Perform a full search across all resource types as needed without taking too many shortcuts/hacks	2019-03-10 16:09:05 +03:00
kd-11	6ef9dcd62e	rsx: Handle mismatched/invalidated framebuffer sections when WCB is enabled	2019-03-10 16:09:05 +03:00
kd-11	ef071ebb6b	rsx: Synchronize surface cache and texture cache data - TODO: The whole upload_texture thing is a big hack, fix it properly	2019-03-10 16:09:05 +03:00
eladash	a43e7c172c	Fix shared memory page flags TODO: From hw testing, it seems like sys_memory_get_page_attribute and sys_rsx_context_iomap check page size a little differently get_page_attribute() always go by area flags, sys_rsx_context_iomap checks page by the page granularity This means that if the area page size 64k, but shared memory is mapped with SYS_MEMORY_GRANULARITY_1M It can be mapped for rsxio, but the page attribute will indicate 64k page size :thonk: rsxio memory is verified to need 1m pages.	2019-03-08 23:44:46 +03:00
eladash	7470388e5a	Use error_code in sys_rsx	2019-03-08 23:44:46 +03:00
eladash	26bcd0a4de	Small improvements to sys_event_flag - From RE, only protocols SYS_SYNC_FIFO and SYS_SYNC_PRIORITY are valid - Use conditional atomic op store in a few places - Properly revert changes in sys_event_flag_set when aomic op fails	2019-03-08 23:44:46 +03:00
elad	bd259c8ae4	vulkan zcull: Fix deadlock in zcull flush waiting Block adding additional flush requests until the first ones are treated (by adding missing lock)	2019-03-08 23:44:46 +03:00
elad	fc253165e2	Correctness fix for RSXIOMem - Make RSXIOMem volatile. - Hint the compiler to check only once the address returned.	2019-03-08 23:44:46 +03:00
elad	b7da3ea5cd	Release ppu thread before ShowSaveDataDialog, Fixes #4031	2019-03-08 23:44:46 +03:00
elad	ce8c92262d	Treat X8R8G8B8 format as A8R8G8B8 in image_in, Fixes #5510	2019-03-08 23:44:46 +03:00
elad	f272a5f779	sys_lwmutex fixup after #5680 sys_lwcond_wait unlocks always with the 'usual' unlocking flags	2019-03-08 23:44:46 +03:00
RipleyTom	61bd2ea799	Adds VID/PID for Guitar Hero guitar & drum	2019-03-08 17:52:48 +00:00
elad	3c9f03968c	Yield before flushing io buffers in fsync (sys_fs) (#5506 )	2019-03-08 16:07:14 +00:00
Nekotekina	4ea76def7c	Update sys_lwmutex_lock and sys_lwmutex_unlock (liblv2 HLE) Implement missing SYS_SYNC_RETRY logic Following #5680	2019-03-06 15:09:50 +03:00
Nekotekina	986c750fdc	VFS: fix sys_fs_opendir on root	2019-03-05 22:12:01 +03:00
eladash	e38b7aee5a	check address in sys_rsx_context_iomap * Fix 0 vm page flags to behave like 1m flags, follows c8a681e60 * check if address exists and valid for rsx io allcations (must be allocated on 1m pages)	2019-03-05 21:23:24 +03:00
eladash	d82362fa1d	Use sys_memory_allocate on rsx replayer to fix it	2019-03-05 21:23:24 +03:00
Nekotekina	fb64b28886	SPU LLVM: reintroduce branch patchpoints Previously only used on SPU ASMJIT, may improve perf in some cases. Now refactored to spu_runtime::make_branch_patchpoint.	2019-03-01 00:08:20 +03:00
Nekotekina	7f6a410770	Add dummy __has_builtin macro, use rotate builtins if possible	2019-03-01 00:08:19 +03:00
Nekotekina	765d15f23f	Optimize SPU trampolines Load values in EAX and reuse it if possible	2019-03-01 00:08:19 +03:00
Nekotekina	f143035af1	Fix sys_spu_thread_group_join wait condition After waiting, thread group cannot be safely accessed Following #5643	2019-03-01 00:08:19 +03:00
RipleyTom	de5379a69f	Static hle implementation	2019-02-27 22:54:59 +03:00
eladash	a22297f205	exception throwing fix in sys_lwmutex_create arg6 doesnt exist, if arg4 is not negative name is discarded and treated as 0.	2019-02-27 22:16:08 +03:00
eladash	d4459af4b3	Implement _sys_lwmutex_unlock (SYS_SYNC_RETRY mode)	2019-02-27 22:16:08 +03:00
RipleyTom	ad6b0ee122	Adds class type to controller options	2019-02-27 18:13:19 +00:00
Megamouse	b107718869	sysPrxForUser: improve crash dump functions this might fix some crashes that could appear in the todo logging itself	2019-02-26 21:53:59 +00:00
German	4c72f7c1de	Fix clear string container in CgBinaryFragmentProgram.cpp	2019-02-18 16:34:16 +03:00
German	3b9f9dd4c5	Fix true clear string container in GameInfo.h	2019-02-18 16:34:16 +03:00
German	45c31a99a3	Fix true clear string container in PPUModule.cpp	2019-02-18 16:34:16 +03:00
elad	63a9421634	Fix race in sys_lwcond_wait on error code	2019-02-16 21:41:59 +03:00
Megamouse	d4888a4973	cellSysCacheMount: don't return RET_OK_RELAYED on empty cacheId The system cache is supposed to be cleared but I don't think we wanna do that	2019-02-13 01:55:07 +03:00
Megamouse	4a1499e0be	cellMsgDialog: optionally make dialogs blocking and fix exit condition and apply review fixes	2019-02-12 21:06:10 +03:00
Megamouse	fe79e541dd	cellGame: improve exit functions	2019-02-12 21:06:10 +03:00
Megamouse	17a5e0bc98	cellGame: add error_code	2019-02-12 21:06:10 +03:00
eladash	d6995f40c7	Fixup for sys_lwcond_signal_x error checking	2019-02-11 01:13:29 +03:00
eladash	fa647bc121	Fix race condion in sys_spu_thread_group_join	2019-02-10 18:20:24 +03:00
eladash	84d42ecb65	Add EFAULT checks to spu_thread_group_join, ppu_thread_join Order of checks is based on firmware	2019-02-10 00:16:57 +03:00
eladash	0861226271	Make more use of the new atomic_t<>::release	2019-02-10 00:16:57 +03:00
eladash	e3ee481f01	Make sys_spu_thread_group_join return once per termination	2019-02-10 00:16:57 +03:00
kd-11	19ff95da70	vk: Fix usage of VK_IMAGE_LAYOUT_GENERAL - Properly synchronize when transitioning to/from GENERAL layout. - General layout requires full pipeline dependency since its used in a 'general' sense. As such, its use is to be largely avoided.	2019-02-07 11:40:17 +03:00
kd-11	38887bc03e	gl/vk: Improvements to overlay rendering - gl: Properly initialize and manage sampler states - gl/vk: Snap overlay elements to pixel grid by aligning to pixel centers - overlays: Disable grid snapping in stb since its now handled in the backend	2019-02-05 12:15:12 +03:00
kd-11	4c593959fd	overlays/save_dialog: Layout improvements - Make detail a separate text entity as it often contains a lot of noise - Properly pad the entry if needed to avoid text sitting too close to the edge	2019-02-03 22:26:46 +03:00
kd-11	67cdec577f	overlays/util: Add support for glyph set lowering when mapping utf8 to ascii8 - Lower fullwidth glyphs to halfwidth counterparts - Lower CJK punctuation glyphs - Lower general punctuation glyphs	2019-02-03 22:26:46 +03:00
kd-11	a36d3af3b4	vk: Minor frame management improvements	2019-02-02 11:54:01 +03:00
kd-11	27af05da1a	osk: Fixup attempt for hang in close callback where a sysutil_callback fails to fire.	2019-02-02 11:54:01 +03:00
kd-11	b36cb66129	overlays: Allow use of extended ascii8 - Use custom string conversion to ensure overlay deals with extended ascii whenever possible - Improves language compatibility greatly and avoids empty spaces for unknown glyphs	2019-02-02 11:54:01 +03:00
kd-11	12990f3ca3	overlays/util: Strip extended codes from utf-16 encoded strings	2019-02-02 11:54:01 +03:00
kd-11	9e39e2d2c4	gl/vk: Fix clip region scaling for overlay elements	2019-02-02 11:54:01 +03:00
kd-11	3653c2eb0d	overlays/osk: Add support for edit text control and disabled cells - Allows to disable cells from being selectable. - Edit text control adds proper support for multiline and a functioning caret	2019-02-02 11:54:01 +03:00
kd-11	faf5221b0d	overlays: Implement edit_text control	2019-02-02 11:54:01 +03:00
kd-11	c434e0ce27	overlays/osk: Add more buttons to native dialog and other improvements - Adds all the major buttons to native dialog input options - Adds more button options for the native osk - Brighten osk cell backgrounds a bit to improve visibility	2019-02-02 11:54:01 +03:00
kd-11	9ed9d7e947	overlays/osk: Implement native osk interface	2019-02-02 11:54:01 +03:00
kd-11	9d4b19b97a	vk: Increase number of draw calls per frame for overlays to 1024 - Allows for more complex interface design	2019-02-02 11:54:01 +03:00
kd-11	f47d3a761b	vk: Hotfix for fullscreen not working on non-windows platforms	2019-02-01 00:22:11 +03:00
Megamouse	27f6f497a2	use "config/custom_configs/" for custom configs (backwards compatible)	2019-01-31 20:14:52 +00:00
kd-11	09a8f7ae53	vk: Use FIFO mode for vsync - Avoids tearing and also hides some driver bugs causing fullscreen bugs with mailbox mode	2019-01-31 21:53:02 +03:00
kd-11	3bfa564ef8	vk/windows: Try to keep msq thread from ever stopping - NVIDIA drivers hook into the msq before our nativeEvent handler. This means NV is aware of events before rpcs3 is aware of them and sometimes stops until a new event is triggered. If rpcs3 is inside a driver call at this time, the system will deadlock since the driver waits for msq which waits for the renderer which waits for the driver. - Use explicit hook management to control window events - Add fence timeout to attempt detection of surface loss events	2019-01-31 21:53:02 +03:00
eladash	d4a24433e8	Fix DECR mode allocations (sys_memory)	2019-01-31 16:03:38 +03:00
Nekotekina	400718dfd9	cellSaveData: try to handle occasional failures Retry moving directory on FILE_ACCESS_ERROR	2019-01-31 01:08:30 +03:00
eladash	6f770c8e35	Fix potential crash in begin_occlusion_query() while closing the Emu	2019-01-30 18:44:29 +03:00
Nekotekina	58358e85dd	spu_runtime::add minor optimization Use preallocated vectors in trampoline generation subroutine	2019-01-29 03:32:16 +03:00
Nekotekina	2b66abaf10	Implement atomic_t<>::release More relaxed store with release memory order	2019-01-29 03:32:16 +03:00
Nekotekina	50922faac9	Remove SPUThread::jit_dispatcher Use global array - save memory Move the array to JIT memory	2019-01-29 03:32:16 +03:00
Nekotekina	4292997a01	Added jit_runtime class Is a memory manager for ASMJIT, replaces asmjit::JitRuntime Unified memory manager for ASMJIT and LLVM Unified SPU trampoline generation Remove previous workarounds	2019-01-29 03:32:16 +03:00
eladash	587fe421ee	Make ppu main_thread unjoinable	2019-01-25 18:04:33 +03:00
eladash	56b7581ade	Return error code in sys_ppu_thread_get_join_state	2019-01-25 18:04:33 +03:00
kd-11	660bfeabae	gl: Fixup - inline arrays	2019-01-25 14:34:22 +03:00
kd-11	fa9b448686	vk: Spec fixups - Disable DEPTH<->RGBA typeless transfers for now as they require a lot more work to work for all vendors - Do not allow switching layouts to UNDEFINED/PREINITIALIZED formats	2019-01-25 14:34:22 +03:00
kd-11	2163a59649	rsx: Typo fix	2019-01-25 14:34:22 +03:00
kd-11	521969bcc3	gl: Remove GL_R 'format'. There is no GL_R format, it part of the S-T-Q-R enums for texture coordinate space	2019-01-25 14:34:22 +03:00
kd-11	5a4bea8c4f	gl: Blit fixup - Typo fix. I meant to disable scissor test, not stencil test - Also clean up and simplify/optimize the core logic	2019-01-25 14:34:22 +03:00
kd-11	7e33cdcb08	rsx: simple_array<T> improvements - Implement move and copy ctors	2019-01-25 14:34:22 +03:00
kd-11	fb778e4821	rsx: Reimplement attrib divisor	2019-01-25 14:34:22 +03:00
kd-11	736415fcd9	rsx/fp: Detect broken/NOP shaders automatically - Do not compile body if the shader is of no consequence, leave as a passthrough shader	2019-01-25 14:34:22 +03:00
kd-11	6fdc0fd7f0	rsx: Reimplement MSAA transparency - Apply dither to edges that almost fail the straight-up alpha test - Significantly improves alpha tested geometry far from the camera - Also removes blend factor overrides/hacks as they give incorrect results due to background bleeding	2019-01-25 14:34:22 +03:00
kd-11	10a17feda2	rsx: Avoid potential deadlock in FIFO_ctrl	2019-01-25 14:34:22 +03:00
kd-11	7eec702c6d	gl: Fix silly regression with blit dst resource readback	2019-01-25 14:34:22 +03:00
kd-11	8093c9b573	rsx: Disable rtt side-effects when async compilation is ongoing. Only real renders should promote buffer state from underined to drawn, otherwise keep previous contents intact.	2019-01-25 14:34:22 +03:00
kd-11	417a2e6731	rsx: Refactor index buffers - Index offset is ignored anyway and only used to calculate vertex attribute divisor index - Specialized optimization for untouched xfer without primitive restart	2019-01-25 14:34:22 +03:00
elad	afeacc171f	Fix spurious abort in sys_rwlock_tryrlock and sys_semaphore_trywait (#5579 ) Use full cmpxchg loop to prevent occasional return of CELL_EBUSY	2019-01-22 23:10:17 +03:00
Nekotekina	4f152ad126	SPU: multithread compilation Allow parallel compilation of SPU code, both at startup and runtime Remove 'SPU Shared Runtime' option (it became obsolete) Refactor spu_runtime class (now is common for ASMJIT and LLVM) Implement SPU ubertrampoline generation in raw assembly (LLVM) Minor improvement of balanced_wait_until<> and balanced_awaken<> Make JIT MemoryManager2 shared (global) Fix wrong assertion in cond_variable	2019-01-22 22:02:02 +03:00
Megamouse	8d5d44141e	rsx/Qt: fix some undefined behavior in progress_dialog CallAfters	2019-01-22 12:04:01 +03:00
eladash	688d5a9919	rsx: Fix unknown vertex base types Clamp vertex type field into 3-bits instead of 4-bit value Case 0 is UB256	2019-01-21 22:28:20 +03:00
Nekotekina	d4591b1508	ALSA: disable recovery (experimental)	2019-01-18 16:49:17 +03:00
Nekotekina	59e0296281	cellMsgDialog: fix error spam on CELL_OK	2019-01-18 16:49:17 +03:00
Nekotekina	cc430769c6	Rollback audio backend priority	2019-01-18 16:49:17 +03:00
eladash	a11d76249d	Patch ppu main thread prio	2019-01-17 21:58:09 +03:00
elad	fc92ae4085	SPU/PPU atomics performance and LR event fixes (#5435 ) * Fix SPU LR event setting in atomic commands according to hw test * MFC: increment timestamp for PUT cmd in non-tsx path * MFC: fix reservation lost test on non-tsx path in regard to the lock bit * Reservation notification moved out of writer_lock scope to reduce its lifetime * Use passive_lock/unlock in ppu atomic inctrustions to reduce redundancy * Lock only once for dma transfers (non-TSX) * Don't use RDTSC in reservation update logic * Remove MFC cmd args passing to process_mfc_cmd * Reorder check_state cpu_flag::memory check for faster unlocking * Specialization for 128-byte data copy in SPU dma transfers * Implement memory range locks and isolate PPU and SPU passive lock logic	2019-01-15 18:31:21 +03:00
eladash	f19fd23227	spu: Fix support for multiple lists when one is stalled	2019-01-15 02:33:22 +03:00
Megamouse	58a22d1461	cellSaveData: add error_code	2019-01-14 21:12:13 +01:00
Nekotekina	c5026f7109	cellVdec: fix minor race	2019-01-14 01:24:05 +03:00
Nekotekina	a419e98acb	Move PPU and shader cache New hash-based location (already used for SPU) Bump PPU cache version, improve naming and decrease size Remove fs::get_data_dir Disable boot.elf cache	2019-01-14 01:24:05 +03:00
Nekotekina	aefee04c4a	SPU analyser: fix branch to self Fixed not filling the predeccessor list on BR-to-self on entry point Version bumped (v1-tane) Closes #5353	2019-01-14 00:01:27 +03:00
Nekotekina	74d684b57e	cellAudio: fix template arg style Add constexpr if	2019-01-14 00:01:27 +03:00
Nekotekina	435f60d503	lf_queue: add iterator support Allow range-for loop over an object returned by `pop_all()`	2019-01-13 14:45:36 +03:00
Nekotekina	cfdf50dcff	SPU: ensure sys_spu_thread_group_join receives correct exit status Following #5334	2019-01-13 14:45:36 +03:00
Nekotekina	453344c232	cellSaveData: workaround possible issues with symlinks Don't use ../ location for temporary directories	2019-01-13 14:45:36 +03:00
Nekotekina	bd9131ae1c	Implement fs::get_cache_dir Win32: equal to config dir for now Linux: respect XDG_CACHE_HOME if specified OSX: possibly incomplete	2019-01-13 14:45:36 +03:00
eladash	bc27f5f75c	Implement invalid NV4097_NOTIFY context handling	2019-01-13 12:59:00 +03:00
Megamouse	022550a43b	cellOskDialog: use atomic_op for state operations	2019-01-12 23:39:01 +01:00
Megamouse	d7cc97433d	cellOskDialogUnloadAsync: guarantee 0 terminated return string	2019-01-12 23:39:01 +01:00
Megamouse	fce9f352a9	cellOskDialog: fix cellOskDialogAbort	2019-01-12 23:39:01 +01:00
Megamouse	f9c1b15bf4	cellOskDialog: fix cellOskDialogUnloadAsync return string fixes Class of Heroes 2G	2019-01-12 23:39:01 +01:00
Rui Pinheiro	3406acd8c9	Fixups for audio PR	2019-01-12 22:22:03 +03:00
Rui Pinheiro	49fbf9bf0f	Tweaks to buffering algorithm Increase untouched buffer timeout when some of the buffers have been touched. Might improve audio quality on games that suffered from miniscule popping even when buffering was enabled (such as DeS). In addition, made time stretching algorithm slightly more aggressive. Includes some other tiny tweaks as well.	2019-01-12 21:29:56 +03:00
Rui Pinheiro	1e4513e2e3	Fixups in audio backend Removes 's_' prefix from variables that are no longer static and thread_local. Removes superfluous comments left behind due to copy-paste mistakes.	2019-01-12 21:29:56 +03:00
Rui Pinheiro	fe9062671e	Change audio tooltips, audio backend order	2019-01-12 21:29:56 +03:00
Rui Pinheiro	48db0430d4	Misc. Tweaks	2019-01-12 21:29:56 +03:00
Rui Pinheiro	650bc0c1f2	Fix game pausing/unpausing	2019-01-12 21:29:56 +03:00
Rui Pinheiro	f17f984721	Add timeout for untouched buffers	2019-01-12 21:29:56 +03:00
Rui Pinheiro	8f6043b568	Change cellAudio diagnostic messages to Trace	2019-01-12 21:29:56 +03:00
Rui Pinheiro	67f9397746	Various fixes In addition, linux builds (and ALSA/PA) now work again	2019-01-12 21:29:56 +03:00
Rui Pinheiro	4f39457858	Rewrite OpenAL backend to support new features	2019-01-12 21:29:56 +03:00
Rui Pinheiro	892deb1552	Implement basic time stretching + Tweaks	2019-01-12 21:29:56 +03:00
Rui Pinheiro	5159d3559e	Implement Audio Backend Capabilities querying Also renames "AudioThread" to "AudioBackend". The new name is more descriptive of what the class really is responsible for, since the backends are not responsible for managing the audio thread. NOTE: Right now only XAudio2 is supported	2019-01-12 21:29:56 +03:00
Rui Pinheiro	2addbe6be2	Implement basic cellAudio buffering	2019-01-12 21:29:56 +03:00
Rui Pinheiro	56962aa707	Disable OpenAL backend temporarily	2019-01-12 21:29:56 +03:00
Megamouse	e18e9909af	cellOsk fixup	2019-01-12 08:26:04 +01:00
RipleyTom	fad80ed443	revert part of #5529	2019-01-12 06:59:07 +01:00
Megamouse	e3ea29599d	cellGame: fix some installation issues fixes HAWX2 (at least until it crashes again)	2019-01-11 03:36:22 +03:00
Megamouse	d9d5f45e9e	rsx/input: fix rsx replay	2019-01-10 13:05:48 +01:00
Megamouse	e5aede7aa7	cellOskDialog: initial code for cellOskDialogSetSeparateWindowOption	2019-01-10 13:05:48 +01:00
Megamouse	17058113df	cellOskDialog: add multi-line option and handle more permutations (WIP)	2019-01-10 13:05:48 +01:00
Megamouse	4a8b30c625	cellOskDialog: cellOskDialogExtRegisterConfirmWordFilterCallback	2019-01-10 13:05:48 +01:00
Megamouse	e0ac244fed	split MsgDialogBase	2019-01-10 13:05:48 +01:00
Megamouse	d5303b0b64	add error_code to cellOskDialog and cellMsgDialog	2019-01-10 13:05:48 +01:00
Megamouse	7cc4239cc2	cellOskDialog: add message	2019-01-10 13:05:48 +01:00
Megamouse	cc30b4e5be	cellOskDialog: don't send input signals without seperate screen enabled	2019-01-10 13:05:48 +01:00
Megamouse	16f2975792	cellOskDialog: properly handle dialog states to improve param checks	2019-01-10 13:05:48 +01:00
kd-11	52ac0a901a	rsx: improve memory coherency - Avoid tagging and rely on read/write barriers and the dirty flag mechanism. Testing is done with a weak 8-byte memory test - Introducing new data when tagging breaks applications with race conditions where tags can overwrite flushed data	2019-01-06 10:44:40 +03:00
kd-11	89c9c54743	rsx: Minor hot-fix - Pitch 0 makes sense if width == 1 and height == 1	2019-01-06 10:44:40 +03:00
kd-11	95245bdd83	rsx: Improve ARGB8->D24S8 casting - Set up partial transfers - Force clear of target before starting the transfer	2019-01-06 10:44:40 +03:00
kd-11	475cc99117	rsx: Fix dirty flag reset after a partial attachment initialization - D24S8 targets have 2 aspects that are dealt with separately; Forcefully initialize the remaining data if a partial init is done. Its 'free' anyway - It seems that the stencil mask matters when clearing unlike the depth mask and color mask	2019-01-06 10:44:40 +03:00
kd-11	c80c7f06bb	rsx: Typo fix - This silly typo broke the flip improvements in the GT fixes PR	2019-01-06 10:44:40 +03:00
kd-11	2a62fa892b	rsx: Texture cache refactor - gl: Include an execution state wrapper to ensure state changes are consistent. Also removes a lot of required 'cleanup' for helper methods - texture_cache: Make execition context a mandatory field as it is required for all operations. Also removes a lot of situations where duplicate argument is added in for both fixed and vararg fields - Explicit read/write barrier for framebuffer resources depending on usage. Allows for operations like optional memory initialization before reading	2019-01-06 10:44:40 +03:00
kd-11	0f64583c7a	rsx: Reimplement pitch lookup - Remove the required_xxx_pitch constraint as it makes no sense. The pitch controls what can be written per line. - It is possible to have a huge surface width but only render to a small region at the beginning and have a smaller pitch than can fit the surface (NFS carbon)	2019-01-06 10:44:40 +03:00
kd-11	1ffadbe086	rsx: Reorganize write barrier implementation (either clear or memory barrier)	2019-01-06 10:44:40 +03:00
kd-11	9c45ce6d37	vk: Reimplement typeless memory allocation to handle resolution upscaling	2019-01-06 10:44:40 +03:00

... 5 6 7 8 9 ...

5968 commits