GPU-accelerated Llama3.java inference in pure Java using TornadoVM github.com 41 points by pjmlp 2 days ago