Takafumi has really built up some amazing infrastructure in the package ecosystem. His Transducers.jl [1] is really interesting and powerful and lately he's done a lot of work with things like FLoops.jl [2] and ThreadsX.jl [3] to try and bring the benefits of transducers to more 'regular' familiar representations so more people can enjoy the benefits. The basic idea behind all of it is that he has an efficient and modular way of describing various 'looping' constructs that can be stuck together, optimized and parallelized automatically.
It'd be quite interesting to see this stuff extended to GPUs.
Tkf has so many cool packages! Less to do with transducers or parallelism, but in terms of just cool and useful stuff, Maybe [1] and BangBang [2] definitely come to mind as well.
It'd be quite interesting to see this stuff extended to GPUs.
[1] https://github.com/JuliaFolds/Transducers.jl
[2] https://github.com/JuliaFolds/FLoops.jl
[3] https://github.com/tkf/ThreadsX.jl