v0.1.3
What's Changed
- Add torch compile to benchmark by @oulgen in #545
- Fix issues with wrong dtypes in generated code by @jansel in #542
- Limit concurrent precompile jobs while autotuning by @jansel in #543
- Create basic helion benchmark runner by @oulgen in #544
- Add multi selection radio buttons by @oulgen in #547
- Fix benchmark condition by @oulgen in #548
- Move to dispatcher model for benchmarking by @oulgen in #549
- Give permissions by @oulgen in #550
- Do not downgrade torch/triton by @oulgen in #551
- Use uv for pip freeze by @oulgen in #552
- Add jagged hstu attention example (i.e. ragged_attention) by @xuanzhang816 in #554
- Install quack/torchbench with no deps by @oulgen in #553
- Update test-reports dir by @oulgen in #556
- torch.rand_like and torch.randn_like support by @yf225 in #530
- [Benchmark] add addmm example and test by @Sibylau in #555
- Kick off benchmarks at midnight by @oulgen in #559
- Use profiler instead of inductor_benchmarker by @oulgen in #560
- Shard kernels by @oulgen in #561
- Add layer_norm and softmax by @oulgen in #562
- [Fix CI] Convert tiles to sizes for all torch.* functions by @yf225 in #563
Full Changelog: v0.1.2...v0.1.3