Starting January we should probably move to start up-streaming the performance-beneficial changes. Division is a good place to start. - [x] Use the `rcp.approx.ftz.f64` (or f32) as the first guess in the `newton_division` - [x] Try to replace some reciprocals in operators (e.g. \delata) as well. - [x] study performance impact - [ ] see if there are problems with regression tests as a result
Starting January we should probably move to start up-streaming the performance-beneficial changes. Division is a good place to start.
rcp.approx.ftz.f64(or f32) as the first guess in thenewton_division