New ask Hacker News story: Ask HN: Where to learn the specifics of how GPT4-quality models are created?

March 18, 2023

Ask HN: Where to learn the specifics of how GPT4-quality models are created?
3 by hn92726819 | 1 comments on Hacker News.
I have never followed AI/ML work. In my experience, all the AI/ML workers I've encountered seen to throw stuff at the wall and see what sticks. Clearly, that is not how the GPT-3/GPT-4 models were built. Is there a resource that explicitly explains at a technical level the thought process behind a full implementation of a model like these? For example, why they chose to add an X-type layer here, or what layers they may have tweaked and why to improve 3.5 to 4's quality? I understand OpenAI is closed source. But I'm sure there isn't some secret sauce that only employees there know about that makes these models happen, so where does one learn this?

Search This Blog

Call center services in india

New ask Hacker News story: Ask HN: Where to learn the specifics of how GPT4-quality models are created?

Comments

Post a Comment

Popular posts from this blog

How can Utilize Call Center Outsourcing for Increase your Business Income well?

New ask Hacker News story: EVM-UI – visual tool to interact with EVM-based smart contracts

New ask Hacker News story: Ask HN: Should I quit my startup journey for now?