dynarmic

mirror of https://github.com/azahar-emu/dynarmic synced 2025-11-13 02:20:00 +01:00

Author	SHA1	Message	Date
merry	64adc91ca2	emit_x64_memory: Move EmitFastmemVAddr to common file	2022-03-26 16:49:14 +00:00
merry	18f02e2088	emit_x64_memory: Move EmitVAddrLookup to common file	2022-03-26 16:46:06 +00:00
merry	3d657c450a	emit_x64_memory: Share EmitDetectMisalignedVAddr	2022-03-26 16:09:56 +00:00
merry	fb586604b4	emit_x64_memory: Share constants	2022-03-26 16:05:03 +00:00
merry	5cf2d59913	A32: Add AccType information and propagate to IR-level	2022-03-26 15:38:10 +00:00
merry	614ecb7020	A64: Propagate AccType information to IR-level	2022-03-26 15:38:10 +00:00
merry	879f211686	ir/value: Add AccType to Value	2022-03-26 15:38:10 +00:00
Alexandre Bouvier	9d369436d8	cmake: Fix unicorn and llvm	2022-03-22 20:27:01 +00:00
merry	c78b82dd2c	vfp: VLDM is UNPREDICABLE when n is R15 in thumb mode	2022-03-20 20:52:11 +00:00
Sergi Granell	0ec4a23710	thumb32: Implement LDA and STL Note that those are ARMv8 additions to the Thumb instruction set.	2022-03-20 20:16:27 +00:00
merry	e1a266b929	A32: Implement SHA256SU1	2022-03-20 13:59:18 +00:00
merry	ab4c6cfefb	A32: Implement SHA256SU0	2022-03-20 13:59:18 +00:00
merry	c022a778d6	A32: Implement SHA256H, SHA256H2	2022-03-20 13:59:18 +00:00
merry	bb713194a0	backend/x64: Implement SHA256 polyfills	2022-03-20 13:59:18 +00:00
merry	98cff8dd0d	IR: Implement SHA256MessageSchedule{0,1}	2022-03-20 13:59:18 +00:00
merry	f0a4bf1f6a	IR: Implement SHA256Hash	2022-03-20 13:59:18 +00:00
merry	a4daad6336	block_of_code: Add HostFeature SHA	2022-03-20 00:13:03 +00:00
Merry	bcfe377aaa	x64/reg_alloc: More zero extension paranoia	2022-03-06 12:24:50 +00:00
Merry	316b95bb3f	{a32,a64}_emit_x64_memory: Zero extension paranoia	2022-03-06 12:10:40 +00:00
Merry	0fd32c5fa4	a64_emit_x64_memory: Fix bug in 128 bit exclusive write fallback	2022-02-28 19:53:43 +00:00
merry	5ea2b49ef0	backend/x64: Inline exclusive memory access operations (#664 ) * a64_emit_x64_memory: Add Unsafe_IgnoreGlobalMonitor optimization * a32_emit_x64_memory: Add Unsafe_IgnoreGlobalMonitor optimization * a32_emit_x64_memory: Remove dead code * {a32,a64}_emit_x64_memory: Also verify vaddr in Exclusive{Read,Write}MemoryInlineUnsafe * a64_emit_x64_memory: Full fallback for ExclusiveWriteMemoryInlineUnsafe * a64_emit_x64_memory: Inline full locking * a64_emit_x64_memory: Allow inlined locking to be optionally removed * spin_lock: Use xbyak instead of inline asm * a64_emit_x64_memory: Recompile on exclusive fastmem failure * Avoid variable shadowing * a32_emit_x64_memory: Implement recompilation * Fix recompilation * spin_lock: Clang format fix * fix fallback function calls	2022-02-28 08:13:10 +00:00
merry	0a11e79b55	backend/x64: Ensure all HostCalls are appropriately zero-extended	2022-02-27 20:04:44 +00:00
merry	6c4fa780e0	{a32,a64}_emit_x64_memory: Ensure return value of fastmem callback are zero-extended	2022-02-27 19:58:23 +00:00
merry	593de127d2	a64_emit_x64: Clear fastmem patch information on ClearCache	2022-02-27 19:50:05 +00:00
Merry	c90173151e	backend/x64: Split off memory emitters	2022-02-26 21:25:09 +00:00
Merry	19a423034e	block_of_code: Fix inaccurate size reporting in SpaceRemaining Typo: getCode should be getCurr: Instead of comparing against the current pointer, we were incorrectly comparing against the start of memory	2022-02-26 16:09:11 +00:00
Merry	ea08a389b4	emit_x64_floating_point: EmitFPToFixed: No need to round if rounding_mode == TowardsZero cvttsd2si truncates during operation	2022-02-23 20:44:02 +00:00
merry	b34214f953	emit_x64_floating_point: Improve EmitFPToFixed codegen	2022-02-23 19:42:15 +00:00
merry	5fe274f510	emit_x64_floating_point: Deinterlace 64-bit FPToFixed signed/unsigned codepaths	2022-02-23 19:14:41 +00:00
merry	b8dd1c7510	emit_x64_floating_point: Correct dead-code warning in MSVC 2019	2022-02-12 22:07:26 +00:00
merry	95a1ebfb97	backend/x64: Bugfix: A32 frontent also uses FPSCR.QC	2022-02-12 21:46:45 +00:00
Fernando Sahmkow	a8cbfd9af4	X86_Backend: set fences correctly for memory barriers and synchronization.	2022-02-01 14:27:54 +00:00
liushuyu	40afbe1927	disassembler_thumb: fix formatting issues with fmt 8.1.x ... ... fmt 8.1.0 added more formatting checks and Cond can't be formatted directly now	2022-01-05 21:49:51 -07:00
Wunkolo	ad5465d6ce	constant_pool: Use `tsl::robin_map` rather than `unordered_map` Finding a much more drastic improvement with `robin_map`. `map`: ``` [master] % hyperfine -r 100 "./dynarmic_tests --durations yes" Benchmark 1: ./dynarmic_tests --durations yes Time (mean ± σ): 567.0 ms ± 6.9 ms [User: 513.1 ms, System: 53.2 ms] Range (min … max): 554.4 ms … 588.1 ms 100 runs ``` `unordered_map`: ``` [opt_const_pool] % hyperfine -r 100 "./dynarmic_tests --durations yes" Benchmark 1: ./dynarmic_tests --durations yes Time (mean ± σ): 561.1 ms ± 4.5 ms [User: 508.1 ms, System: 52.3 ms] Range (min … max): 552.6 ms … 574.2 ms 100 runs ``` `tsl::robin_map`: ``` [opt_const_pool] % hyperfine -r 100 "./dynarmic_tests --durations yes" Benchmark 1: ./dynarmic_tests --durations yes Time (mean ± σ): 553.5 ms ± 5.6 ms [User: 500.7 ms, System: 52.1 ms] Range (min … max): 545.7 ms … 569.3 ms 100 runs ```	2022-01-01 12:13:13 +00:00
Wunkolo	e57bb0569a	constant_pool: Convert hashtype from `tuple` to `pair`	2022-01-01 12:13:13 +00:00
Wunkolo	befc22a61e	constant_pool: Use `unordered_map` rather than `map` `map` is an ordinal structure with log(n) time searches. `unordered_map` uses O(1) average-time searches and O(n) in the worst case where a bucket has a to a colliding hash and has to start chaining. The unordered version should speed up our general-case when looking up constants. I've added a trivial order-dependent(_(0,1) and (1,0) will return a different hash_) hash to combine a 128-bit constant into a 64-bit hash that generally will not collide, using a bit-rotate to preserve entropy.	2022-01-01 12:13:13 +00:00
Morph	28714ee75a	general: Rename files with duplicate names In MSVC, having files with identical filenames will result into massive slowdowns when compiling. The approach I have taken to resolve this is renaming the identically named files in frontend/(A32, A64) to (a32, a64)_filename.cpp/h	2021-12-23 11:38:58 +00:00
Andrea Pappacoda	4dcebc1822	build(cmake): add install target This makes dynarmic installable, and also adds a CMake package config file, that allows projects to use `find_package(dynarmic)` to import the library. I know #636 adds the same thing, but while experimenting with the different install options in https://github.com/merryhime/dynarmic/pull/636#discussion_r725656034 I ended up with a working patch, so I'm proposing this as well. This implements solution 2.	2021-10-30 19:03:23 +01:00
Andrea Pappacoda	b87a889d98	build(cmake): add version and soversion to the library This adds versioning information to the built library. When building the shared library on Linux systems, a new object will be created: libdynarmic.so.5 This is really useful when talking about ABI compatibility. The variables dynarmic_VERSION and dynarmic_VERSION_MAJOR are implicitly created when calling project(dynarmic VERSION x.y.z)	2021-10-11 06:53:05 +01:00
Fernando S	e4146ec3a1	x64 Interface: Allow for asynchronous invalidation (#647 ) * x64 Interface: Make Invalidation asynchronous. * Apply suggestions from code review	2021-10-05 15:06:41 +01:00
Wunkolo	5e7d2afe0f	IR: Introduce `VectorReduceAdd{8,16,32,64}` opcode Adds all elements of vector and puts the result into the lowest element. Accelerates the `addv` instruction into a vectorized implementation rather than a serial one.	2021-09-27 19:54:11 +01:00
Marshall Mohror	0b8fd755d8	Fix `signal_stack_size` for glibc 2.34 `SIGSTKSZ` is now defined as `sysconf(_SC_SIGSTKSZ)` which is not constexpr, and returns a long which throws off the `std::max` template deduction.	2021-09-22 20:38:11 +01:00
Ben	6ce8bfaf32	Add API function to retrieve dissassembly as vector of strings (#644 ) Co-authored-by: ben <Avuxo@users.noreply.github.com>	2021-09-16 16:45:20 -04:00
Merry	517e35f845	decoder_detail: Avoid MSVC ICE MSVC has an internal compiler error when assume is present in this constexpr function	2021-08-15 19:32:05 +01:00
Merry	2e4f99ae3d	CMakeLists: Expose DYNARMIC_IGNORE_ASSERTS option	2021-08-15 16:09:37 +01:00
Merry	4988d9fab3	disassembler_arm: Fix format strings for vfp_VMOV_from_i{8,16}	2021-08-15 15:16:53 +01:00
Merry	615ce8c7c5	IR: Remove A32 IR instructions Get{N,Z,V}Flag	2021-08-12 13:06:15 +01:00
Wunkolo	1e94acff66	ir: Add VectorBroadcastElement{Lower} IR instruction The lane-splatting variant of `FMUL` and `FMLA` is very common in instruction streams when implementing things like matrix multiplication. When used, they are used very densely. https://community.arm.com/developer/ip-products/processors/b/processors-ip-blog/posts/coding-for-neon---part-3-matrix-multiplication The way this is currently implemented is by grabbing the particular lane into a general purpose register and then broadcasting it into a simd register through `VectorGetElement` and `VectorBroadcast`. ```cpp const IR::U128 operand2 = v.ir.VectorBroadcast(esize, v.ir.VectorGetElement(esize, v.V(idxdsize, Vm), index)); ``` What could be done instead is to keep it within the vector-register and use a permute/shuffle to "splat" the particular lane across all other lanes, removing the GPR-round-trip. This is implemented as the new IR instruction `VectorBroadcastElement`: ```cpp const IR::U128 operand2 = v.ir.VectorBroadcastElement(esize, v.V(idxdsize, Vm), index); ```	2021-08-07 23:03:57 +01:00
Wunkolo	46b8cfabc0	bit_util: Protect Replicate from automatic up-casting Recursive calls to `Replicate` beyond the first call might cause an unintentional up-casting to an `int` type due to `\|` and `<<` operations on types such as `uint8_t` and `uint16_t` This makes sure calls such as `Recursive<u8>` stay as the `u8` type through-out.	2021-08-07 23:03:57 +01:00
Merry	d41bc492fe	{a32,a64}_jitstate: Remove unnecessary headers	2021-08-07 19:35:33 +01:00

... 3 4 5 6 7 ...

2607 Commits