DeepDive in everything of Llama3: revealing detailed insights and implementation
10 comments
·February 21, 2025kevmo314
therealoliver
Yes, these are two different learning paths. The detailed process learning is beneficial for future research, while the API-style approach is convenient and quick for getting started and using. Both are very useful!
simonw
I hadn't realized OpenAI's tiktoken Python library could work with other models outside of the OpenAI family, that's really useful: https://github.com/therealoliver/Deepdive-llama3-from-scratc...
therealoliver
I'm glad to have helped you :)
aghilmort
great need; mulling over; shows up all the time in AI paradigms
therealoliver
glad to have helped you :)
curtisszmania
[dead]
FreebasingLLMs
[flagged]
jawr
If you’ve got nothing constructive to say… don’t say anything? OP brings a lot of value in a style they like, your comment brings absolutely nothing.
therealoliver
Thank you for helping me out. Such comments are really depressing.
I like the use of the functional API here. I learned through a similar route and it was very helpful for me compared to trying to understand `torch.nn.Module`.
Here's a gist of my learning path if it's helpful to anyone: https://gist.github.com/kevmo314/294001659324429bae6749062a9...