Sundaram Ramaswamy
|
d6b93f052e
|
Add nlerp for quaternions
Normalized linear interpolation for quaterions are a cheaper
alternative to slerp. This PR adds nlerp operaiton for quaternions.
|
2021-05-06 23:38:26 +05:30 |
|
Recep Aslantas
|
28705be5a3
|
simd, sse: reduce some computation at glm_mul_rot_sse2()
|
2021-05-01 23:16:03 +03:00 |
|
Recep Aslantas
|
e1b142bce7
|
add todo to quat.h
|
2021-05-01 23:03:41 +03:00 |
|
Recep Aslantas
|
0f96eaad20
|
sse2: optimize glm_mat3_mul_sse2() with sse2
* reduce memory access for dest[2][2]
* the speed is increased ;)
|
2021-05-01 22:55:19 +03:00 |
|
winter091
|
13269f4af8
|
fix typo: vec3 -> vec4
|
2021-05-01 16:48:31 +03:00 |
|
Recep Aslantas
|
faf6186c29
|
sse: optimize glm_mat2_mul_sse2 with sse
|
2021-05-01 03:44:04 +03:00 |
|
Recep Aslantas
|
2be6ac949b
|
sse: optimize glm_quat_mul with sse
|
2021-05-01 03:18:26 +03:00 |
|
Recep Aslantas
|
5b7bc522ac
|
sse: optimize affine with sse
* re-oder instructions for ILP
|
2021-05-01 02:58:14 +03:00 |
|
Recep Aslantas
|
376cf31ee7
|
armi neon: optimize affine with neon
|
2021-05-01 02:46:14 +03:00 |
|
Recep Aslantas
|
d28b381dd6
|
armi neon: optimize mat4 mul with neon
|
2021-05-01 02:45:15 +03:00 |
|
Recep Aslantas
|
3673622cc3
|
simd, sse: optimize mat4 mul-v with sse
* re-oder instructions for ILP
|
2021-05-01 02:17:34 +03:00 |
|
Recep Aslantas
|
a90f706e12
|
simd, sse: optimize mat4 mul with sse
* re-oder instructions for ILP
|
2021-05-01 02:17:08 +03:00 |
|
Recep Aslantas
|
c065d71a2f
|
simd, sse: optimize mat4 inv with sse
* reduce a few shufflees
* re-oder instructions for ILP
|
2021-04-30 21:12:17 +03:00 |
|
Recep Aslantas
|
ba634d6c83
|
simd: optimize glm_mat4_zero() with simd
|
2021-04-30 04:04:27 +03:00 |
|
Recep Aslantas
|
f35badd436
|
now working on v0.8.3
|
2021-04-30 01:00:42 +03:00 |
|
Recep Aslantas
|
04eaf9c535
|
arm, neon: neon/fma support for glm_quat_mul()
|
2021-04-29 01:12:00 +03:00 |
|
Recep Aslantas
|
4e4bff418d
|
arm, neon: neon/fma support for glm_mat2_mul()
|
2021-04-28 22:06:46 +03:00 |
|
Recep Aslantas
|
55ebbdbe40
|
arm, neon: neon/fma support for glm_inv_tr()
|
2021-04-28 14:46:14 +03:00 |
|
Recep Aslantas
|
155eb109a8
|
arm, neon: neon/fma support for glm_mul_rot()
|
2021-04-25 03:49:35 +03:00 |
|
Recep Aslantas
|
2903813765
|
arm, neon: neon/fma support for glm_mul()
|
2021-04-25 03:45:00 +03:00 |
|
Recep Aslantas
|
0ab50f7208
|
arm, neon: update mat4_mul to use FMA
|
2021-04-25 03:41:39 +03:00 |
|
Recep Aslantas
|
701e015bfd
|
avoid loading vec3 by glmm_load()
* use glmm_set1() for each for now
|
2021-04-25 02:36:06 +03:00 |
|
Recep Aslantas
|
1fb941a41b
|
drop swizzling helpers fro now for simplicity
|
2021-04-25 02:35:55 +03:00 |
|
Recep Aslantas
|
92151c6328
|
arm, neon: use div instead of mul by 1 / det for mat4_inv
|
2021-04-24 18:02:47 +03:00 |
|
Recep Aslantas
|
afac887850
|
arm, neon: implement mat4 inv with neon
|
2021-04-24 17:54:01 +03:00 |
|
Recep Aslantas
|
a111693b6b
|
arm, neon: implement mat4 determinant with neon
|
2021-04-24 15:45:36 +03:00 |
|
Recep Aslantas
|
ce9e5f5575
|
arm: update glmm swizzling func names
|
2021-04-24 01:38:04 +03:00 |
|
Recep Aslantas
|
d13842e7de
|
arm: optimize vec4 div with NEON
|
2021-04-24 00:51:09 +03:00 |
|
Recep Aslantas
|
059e5010e6
|
arm: define CGLM_ARM64 for identify arm64
|
2021-04-24 00:44:07 +03:00 |
|
Recep Aslantas
|
65292a94a6
|
swizzling functions for NEON
|
2021-04-24 00:00:00 +03:00 |
|
Recep Aslantas
|
f303984aad
|
use unified glmm api for vec4
|
2021-04-23 23:34:36 +03:00 |
|
Recep Aslantas
|
7d5c4da7cf
|
optimize translate functions
|
2021-04-23 23:32:48 +03:00 |
|
Recep Aslantas
|
63988a515c
|
glmm: new defines for splat
|
2021-04-23 23:32:21 +03:00 |
|
Recep Aslantas
|
9725b60d46
|
rename glmm_shuff1x() to glmm_splat()
* mark glmm_shuff1x() as DEPRECATED
|
2021-04-23 22:12:57 +03:00 |
|
Recep Aslantas
|
f14ca0c3f2
|
now working on v0.8.2
|
2021-04-23 17:03:39 +03:00 |
|
Recep Aslantas
|
2de1133012
|
fix enabling FMA macro for MSVC
|
2021-04-19 06:04:22 +03:00 |
|
Recep Aslantas
|
ebba4eea8e
|
win, msvc: enable FMA macro for MSVC
|
2021-04-19 04:14:14 +03:00 |
|
Recep Aslantas
|
aa2fa89e6c
|
arm: fma msub and nmsub
|
2021-04-19 00:35:19 +03:00 |
|
Recep Aslantas
|
7b0eee497e
|
arm: fix fmadd parameter order
|
2021-04-19 00:28:07 +03:00 |
|
Recep Aslantas
|
04008d9c3f
|
arm: fix fma for glm_vec4_muladds
|
2021-04-19 00:21:04 +03:00 |
|
Recep Aslantas
|
11b1588105
|
glmm: missing FMA funcs for SSE and AVX
|
2021-04-19 00:20:47 +03:00 |
|
Recep Aslantas
|
7c81482248
|
avx: implement scale matrix using AVX
|
2021-04-19 00:11:43 +03:00 |
|
Recep Aslantas
|
f3f29bd383
|
vec4: optimize muladd and muladds with fma
|
2021-04-18 16:24:29 +03:00 |
|
Recep Aslantas
|
0d0d22f96c
|
opitimize affine matrix operations with fma
|
2021-04-18 13:51:22 +03:00 |
|
Recep Aslantas
|
7df5aa2e26
|
opitimize mat2 operations with fma
|
2021-04-18 13:51:09 +03:00 |
|
Recep Aslantas
|
7cc4c37afb
|
opitimize mat3 operations with fma
|
2021-04-18 13:51:03 +03:00 |
|
Recep Aslantas
|
abe29a788a
|
opitimize mat4 operations with fma
|
2021-04-18 13:50:51 +03:00 |
|
Recep Aslantas
|
c5655bbd2e
|
glmm: define fma functions
|
2021-04-18 13:49:50 +03:00 |
|
Recep Aslantas
|
47e0045015
|
glmm, x86: define hadd function
|
2021-04-18 13:49:36 +03:00 |
|
Recep Aslantas
|
405cda6ee9
|
now working on v0.8.1
|
2021-03-02 23:36:33 +03:00 |
|