In this video, I’ll show you few shot learning example using GPT-Neo: The open-source solution for GPT-3. GPT‑Neo is the code name for a family of transformer-based language models loosely styled around the GPT architecture. The stated goal of the project is to replicate a GPT‑3 DaVinci-sized model and open-source it to the public, for free.
GPT‑Neo is an implementation of model & data-parallel GPT‑2 and GPT‑3-like models, utilizing Mesh Tensorflow for distributed support. The codebase is optimized for TPUs, but also work on GPUs. Interestingly, Leahy had earlier attempted to replicate GPT-2 through Google’s Tensorflow Research Cloud (TFRC) program, which worked out to their advantage while working on GPT-Neo.
Join this channel to get access to perks:
If you like my work, you can support me by buying me a coffee by clicking the link below: https://www.buymeacoffee.com/bhattbhavesh91
If you do have any questions with what we covered in this video then feel free to ask in the comment section below & I’ll do my best to answer those.
If you enjoy these tutorials & would like to support them then the easiest way is to simply like the video & give it a thumbs up & also it’s a huge help to share these videos with anyone who you think would find them useful.
Please consider clicking the SUBSCRIBE button to be notified for future videos & thank you all for watching.
You can find me on:
Blog - https://bhattbhavesh91.github.io
Twitter - https://twitter.com/_bhaveshbhatt
GitHub - https://github.com/bhattbhavesh91
Medium - https://medium.com/@bhattbhavesh91
About.me - https://about.me/bhattbhavesh91
Linktree - https://linktr.ee/bhattbhavesh91
DEV Community - https://dev.to/bhattbhavesh91