rpcsx

mirror of https://github.com/RPCSX/rpcsx.git synced 2025-12-06 07:12:14 +01:00

Author	SHA1	Message	Date
kd-11	9a2d4fe46b	rsx: Relocatable transform constants	2022-03-26 16:10:18 +03:00
RipleyTom	a4d715e25d	Warning Fixes	2022-03-23 19:35:10 +01:00
Nekotekina	0db9850a73	Add loop building utilities for ASMJIT Refactor copy_data_swap_u32 a bit	2022-01-25 03:16:37 +03:00
Nekotekina	12c83b340d	Remove built_function With today's branch prediction techniques, it's hardly useful.	2022-01-24 22:21:41 +03:00
Nekotekina	4704367382	Remove unnecessary asmjit::imm_ptr	2022-01-18 00:10:32 +03:00
Nekotekina	14cca55b50	PPU: refactor vector rounding instructions Fix: nearbyint -> roundeven	2022-01-18 00:10:32 +03:00
Nekotekina	580bd2b25e	Initial Linux Aarch64 support * Update asmjit dependency (aarch64 branch) * Disable USE_DISCORD_RPC by default * Dump some JIT objects in rpcs3 cache dir * Add SIGILL handler for all platforms * Fix resetting zeroing denormals in thread pool * Refactor most v128:: utils into global gv_** functions * Refactor PPU interpreter (incomplete), remove "precise" * - Instruction specializations with multiple accuracy flags * - Adjust calling convention for speed * - Removed precise/fast setting, replaced with static * - Started refactoring interpreters for building at runtime JIT * (I got tired of poor compiler optimizations) * - Expose some accuracy settings (SAT, NJ, VNAN, FPCC) * - Add exec_bytes PPU thread variable (akin to cycle count) * PPU LLVM: fix VCTUXS+VCTSXS instruction NaN results * SPU interpreter: remove "precise" for now (extremely non-portable) * - As with PPU, settings changed to static/dynamic for interpreters. * - Precise options will be implemented later * Fix termination after fatal error dialog	2022-01-15 06:48:04 +03:00
Nekotekina	cb2748ae08	Update ASMJIT (new upstream API)	2021-12-29 02:45:00 +03:00
Nekotekina	d836033212	LLVM: enable some JIT events (Intel, Perf) Made some related adjustments. Currently incomplete.	2021-12-26 16:41:37 +03:00
Nekotekina	8b4b6ba946	copy_data_swap_u32: build AVX-512 path	2021-12-26 14:40:21 +03:00
Nekotekina	599e00d6da	BufferUtils: remove dead code (vertex streaming) RIP. It won't be useful.	2021-12-26 14:40:21 +03:00
Nekotekina	3cd8891ab8	Re-refactor copy_data_swap_u32 again Drop AVX2 path for now, since it usually operates on small data. Rely on automatic SSE vectorization on recent compilers. Side refactoring on JIT.h to workaround weird conflict issue.	2021-12-26 14:40:21 +03:00
Nekotekina	262ff01619	Use aligned stores in write_index_array_data_to_buffer Ensure that target buffer is cache line aligned. Improve stx::make_single to support alignment.	2021-12-21 23:28:09 +03:00
Nekotekina	76ccaf5e6f	BufferUtils: refactoring Optimize CPU capability tests for arch-tuned builds. Separate streaming and non-streaming utilities. Rewritten copy_data_swap_u32(_cmp) with AVX2 path.	2021-12-21 23:28:09 +03:00
Nekotekina	a1608b636f	span: implement as_span workarounds as utils::bless Minor cleanup.	2021-05-31 15:46:34 +03:00
Ani	a49446c9e9	Replace gsl::span for std::span (c++20) (#7531 ) * Replace gsl::span for std::span (c++20) * Replace gsl::byte with std::byte Co-authored-by: Bevan Weiss <bevan.weiss@gmail.com>	2021-05-30 17:10:46 +03:00
Nekotekina	c22e1e71f0	Continue fixing strict aliasing warnings	2021-03-13 18:02:37 +03:00
Nekotekina	9cbe77904d	Revert changes in BufferUtils.cpp Should fix #9933	2021-03-09 19:19:24 +03:00
Nekotekina	a4fdbf0a88	Enable -Wstrict-aliasing=1 (GCC) Fixed partially.	2021-03-09 03:10:15 +03:00
Nekotekina	8e6e57de86	Enable -Wunused-function warning	2021-02-15 14:39:53 +03:00
Nekotekina	0af452720e	RSX: Fix possible bug in memory streaming utils	2021-01-12 15:06:31 +03:00
Nekotekina	db8e6fe7a7	Enable -Wunused-variable	2021-01-12 14:34:14 +03:00
Nekotekina	bd269bccaf	types.hpp: remove intrinsic includes Replace v128 with u128 in some places. Removed some unused files.	2020-12-21 21:11:25 +03:00
Nekotekina	db9b7db531	Cleanup and move sysinfo.h -> util/sysinfo.hpp	2020-12-18 12:55:54 +03:00
Nekotekina	e321765c54	Split BEType.h to util/v128.hpp and util/to_endian.hpp	2020-12-13 16:34:45 +03:00
Nekotekina	6e05dcadb6	Reduce std::numeric_limits dependency Please, stop pretending... You need these templates for generic code. In other words, in another templates. Stop increasing compilation time for no reason.	2020-12-12 12:35:18 +03:00
Nekotekina	36c8654fb8	Remove HERE macro Some cleanup. Add location to some functions.	2020-12-10 12:30:22 +03:00
Nekotekina	5d934c8759	Improve narrow() and size32() with src_loc detection	2020-12-09 16:26:20 +03:00
Nekotekina	e055d16b2c	Replace verify() with ensure() with auto src location. Expression ensure(x) returns x. Using comma operator removed.	2020-12-09 15:43:38 +03:00
RipleyTom	af8c661a64	Remove BOM markers	2020-12-06 15:30:12 +03:00
Eladash	36fd1d0f0d	rsx: Optimize transform constants load methods (#7992 )	2020-04-09 15:53:43 +03:00
Eladash	d97e9f7b4a	rsx: Batch vertex program load methods	2020-04-02 20:42:12 +03:00
Nekotekina Aux1	250736ece5	Fix warnings in emucore	2020-03-04 21:23:34 +03:00
Nekotekina	5e75a0c497	Disable cotire on travis Make some workarounds for clang because it poorly supports -Wold-style-cast	2020-02-21 17:03:54 +03:00
Nekotekina	c0f80cfe7a	Use attributes for LIKELY/UNLIKELY Remove LIKELY/UNLIKELY macro.	2020-02-05 10:42:34 +03:00
Nekotekina	185c067d5b	C-style cast cleanup V	2019-12-03 17:23:00 +03:00
kd-11	8ca53f9c84	rsx: Remember to min-max the anchor indices of a polygon or triangle fan	2019-11-24 19:01:57 +03:00
Emmanuel Gil Peyrot	f76720ceb0	Remove extraneous ::narrow<int>() calls GSL’s gsl::span didn’t use the correct type for its index_type, which is why they were needed.	2019-11-09 19:30:06 +01:00
Emmanuel Gil Peyrot	ef368c5171	rsx: Replace gsl::byte with C++17’s std::byte	2019-11-09 19:30:05 +01:00
Nekotekina	e3e7051ed3	Minor optimization in BufferUtils.cpp Don't use PSHUFB for horizontal operations. Utilize PHMINPOSUW to compute max as well: + sse41_hmin_epu16 + sse41_hmax_epu16	2019-10-30 18:52:34 +03:00
Nekotekina	b1968769b7	Minor cleanup in BufferUtils.cpp Replace inline asm with intrinsic using target attribute trick.	2019-10-30 17:53:51 +03:00
linkmauve	cfd5cf6bdb	Optimise primitive_restart::upload_untouched() (#6881 ) * rsx: Optimise primitive_restart::upload_untouched() with SSE4.1 This optimisation is only applied when skip_restart is false. I’ve only tested the u16 codepath, as it is the one used in NieR. In some very unscientific profiling, this function used to take 2.76% of the total frame time at the save point of the port town, it now takes about 0.40%. * rsx: Mark all SSE4.1 functions with attributes on gcc and clang This assures the compiler we will take care of only calling these functions after having checked that the CPU does support these instructions. * rsx: Add an AVX2 implementation of primitive restart ibo upload * rsx: Remove redefinition of SSE4.1 instructions Now that clang is aware that our functions are compiled with SSE4.1, it lets us generate this code using its intrinsics. * rsx: Optimise vector to scalar conversion This is done using minpos and srli intrinsics and generate less code than before. Thanks Nekotekina for the suggestion!	2019-10-30 16:42:44 +03:00
kd-11	c59cb1bdd3	rsx: Allow only sse4.1 capable CPUs to take the accelerated index path - Older sets lack the required min/max functionality	2019-09-13 12:28:52 +03:00
kd-11	60845daf45	rsx: Improve use of CPU vector extensions - Allow use of intrinsics when SSSE3 and SSSE4.1 are not available in the build target environment - Properly separate SSE4.1 code from SSSE3 code for some older proceessors without SSE4.1	2019-09-12 14:08:21 +03:00
kd-11	652f18ebaa	rsx/buffers: Warnings cleanup	2019-09-01 18:59:50 +03:00
JohnHolmesII	ebb1ae6408	Properly ignore SIMD macros to avoid warning	2019-06-28 01:40:52 +03:00
kd-11	358169507c	rsx: Use SSE to accelerate index buffer uploads	2019-06-25 20:50:54 +03:00
Lassi Hämäläinen	c963c51a60	Remove unnecessary header includes - Manually removed lot of unneeded #includes to clean code and reduce compilation time - Reordered some of the #includes to be in more logical order	2019-06-25 17:11:10 +03:00
scribam	db926ee671	rsx: Apply Clang-Tidy fix "performance-unnecessary-value-param"	2019-06-12 15:11:52 +03:00
Nekotekina	dfd50d0185	Implement std::bit_cast<> Partial implementation of std::bit_cast from C++20. Also fix most strict-aliasing rule break warnings (gcc).	2019-06-02 23:22:16 +03:00

1 2 3

119 commits