Commit Graph

395 Commits

Author SHA1 Message Date
Magnus Ulimoen ff9a477b67 Remove non-global Threadpool 2021-03-22 19:28:01 +01:00
Magnus Ulimoen 7aadda3de9 Move integrate to separate crate 2021-03-22 17:49:35 +01:00
Magnus Ulimoen be1330ec02 Add constrmatrix as separate crate 2021-03-22 16:24:32 +01:00
Magnus Ulimoen 502679c9a1 Move Float to separate crate 2021-03-22 16:17:27 +01:00
Magnus Ulimoen be984fbdac Bump sprs to 0.10 2021-03-18 23:27:03 +01:00
Magnus Ulimoen 550b43b4cd Bump ndarray 2021-03-16 19:03:35 +01:00
Magnus Ulimoen f098981d3e Update email 2021-03-16 19:00:24 +01:00
Magnus Ulimoen 8383517ba3 ensure slice can be cast to Matrix 2021-03-15 20:18:19 +01:00
Magnus Ulimoen 17ab18e953 zero-pad diffxi kernel 2021-03-15 20:07:41 +01:00
Magnus Ulimoen e43e71a4d8 Make flip_XX impl on Matrix 2021-03-15 19:31:41 +01:00
Magnus Ulimoen 6fc045ae17 Replace transmute with cast 2021-02-12 19:02:13 +01:00
Magnus Ulimoen 5d59a51bbe Add option to output information in JSON 2021-02-12 18:28:30 +01:00
Magnus Ulimoen 743cca00e2 Revert hdf5 patch (TODO: add back info here) 2021-02-12 18:09:15 +01:00
Magnus Ulimoen 78f0c9ed2c Update json5 crate to include location errors 2021-02-12 18:08:10 +01:00
Magnus Ulimoen d11abff1c3 Add checkbox for gridlines 2021-02-12 17:16:36 +01:00
Magnus Ulimoen dc94aa8e6b Merge branch 'feature/const_matrix'
Using constr generics improves performance of diffxi by about
15% without fast-float and to 30% with the fast-float flag
2021-02-10 21:25:08 +01:00
Magnus Ulimoen a02c7daafc remove iterator inhibiting optimisation 2021-02-10 21:17:05 +01:00
Magnus Ulimoen 02175d1734 use some unsafe... 2021-02-10 19:29:26 +01:00
Magnus Ulimoen 8a6dc60edf remove some unsafe from simd 2021-02-10 19:02:48 +01:00
Magnus Ulimoen 87c055f81e Add back simd column algo 2021-02-09 21:44:35 +01:00
Magnus Ulimoen cf4d8f1e9b add Zero for constmatrix 2021-02-03 08:41:11 +01:00
Magnus Ulimoen 7ec426b5a8 add more fast intrinsics 2021-02-03 08:31:33 +01:00
Magnus Ulimoen 64a4e92dd2 specialise on contigous ny 2021-02-02 00:33:37 +01:00
Magnus Ulimoen c709cf465e remove ndarray transmute 2021-02-02 00:12:03 +01:00
Magnus Ulimoen 74d99a4a18 try ndarray transmute 2021-02-02 00:12:03 +01:00
Magnus Ulimoen b15ea57e6d inline for perf 2021-02-02 00:12:03 +01:00
Magnus Ulimoen 299b4f8083 ensure FastFloat flag works 2021-02-02 00:12:03 +01:00
Magnus Ulimoen 6f7268bf33 use matrices everywhere 2021-02-02 00:12:02 +01:00
Magnus Ulimoen 31ac46e386 move data structs into separate files 2021-02-02 00:12:02 +01:00
Magnus Ulimoen f7f8a7ffff make flip const functions 2021-02-02 00:12:02 +01:00
Magnus Ulimoen 1f3aa2c116 make core-intrinsics cfg'ed 2021-02-02 00:12:02 +01:00
Magnus Ulimoen c73c6e7407 diff_op_col_naive_matrix 2021-02-02 00:12:02 +01:00
Magnus Ulimoen c660354c3f Matrix for Upwind4 2021-02-02 00:12:02 +01:00
Magnus Ulimoen 45e4d51513 remove a closure 2021-02-02 00:12:02 +01:00
Magnus Ulimoen b0e1ec62f8 change order in matmul_into 2021-02-02 00:12:02 +01:00
Magnus Ulimoen 481f2d607e remove a lot of unsafe, lost perf 2021-02-02 00:12:02 +01:00
Magnus Ulimoen 00f3ba6a01 use split_at_mut 2021-02-02 00:12:02 +01:00
Magnus Ulimoen db94caf2b2 change repr of Matrix 2021-02-02 00:12:02 +01:00
Magnus Ulimoen c133557459 use Matrix in SBP diff 2021-02-02 00:12:02 +01:00
Magnus Ulimoen 3c7cc4605a simplify traits 2021-02-02 00:12:02 +01:00
Magnus Ulimoen f7c238f6a7 add blockend SBP8 2021-02-02 00:12:02 +01:00
Magnus Ulimoen a7660281c8 add inline to remove magic fix 2021-02-02 00:12:02 +01:00
Magnus Ulimoen 14fefe97ab add Matrix approach for SBP8 2021-02-02 00:12:02 +01:00
Magnus Ulimoen 3d34f2e7a0 add fast-float feature 2021-02-02 00:12:02 +01:00
Magnus Ulimoen 4c2daf5933 minor thingys 2021-02-02 00:12:02 +01:00
Magnus Ulimoen db552af4ff add blockend with weird caveat 2021-02-02 00:12:02 +01:00
Magnus Ulimoen bcda26a512 15% reductions in instr count 2021-02-02 00:12:02 +01:00
Magnus Ulimoen 36293e75e6 10% instr reduction with fast_ intr 2021-02-02 00:12:02 +01:00
Magnus Ulimoen 30c563c19d working d1 SBP4 2021-02-02 00:12:02 +01:00
Magnus Ulimoen 94e8fb5b7c checkpoint 2021-02-02 00:12:02 +01:00