ML News

LLaMA 3: What to Expect from Meta's Upcoming Language Model

Dec 26, 2023

5 min read

Over the last year, over the GPTs of the world, no architecture has truly revolutionized Natural Language Processing for the masses as LLaMA by Meta has. By giving access to millions of trained parameters rivaling that of GPT, LLaMA has jumpstarted how the public perceives and develops open-sourced Large Language Models.

We over here at NimbleBox AI have come to realize the importance of governance and transparency in AI and cannot wait for the launch of the next installment of LLaMA models in the form of LLaMA 3 in the upcoming year. Let us break down all the rumors about the model, what the general public can expect, and how ChatNBX plans to bring it to your systems in our fantastic chatbot.

The LLM Space Since LLaMA 2

Developed by Meta AI, LLaMA3 follows the Open-Approach Meta implemented for LLaMA, which aims to provide a more accessible and transparent alternative to strictly moderated language models like ChatGPT and GPT-4 by OpenAI.

LLaMA2, the freely available predecessor of LLaMA3, gained popularity due to its versatility and customization options for research and commercial use. It garnered over 100,000 requests for access to its weights and inspired countless new variants in the market. Building on this success, Meta AI has decided to capitalize on the LLaMA revolution by partnering with Microsoft. This collaboration was announced at the latest Microsoft Inspire event, making Microsoft the preferred partner for LLaMA2.

With LLaMA3, Meta AI aims to take language models to new heights by incorporating natural language processing and machine learning advancements. This successor model is expected to offer enhanced capabilities and improved performance, further expanding the possibilities for research, development, and commercial applications.

As the successor to LLaMA2, the upcoming LLaMA3 represents Meta AI’s continued commitment to democratizing access to state-of-the-art language models. By leveraging the power of collaboration and open-source principles, LLaMA3 aims to drive innovation and empower users from various domains to harness the potential of large language models.

Do we need a LLaMA 3?

Now, one looming question that everyone has with all these large language models coming every single month, which are bigger and bigger and promise some unbelievable things, is, do we need a LLaMA 3 when we have a GPT-4?

Well, let us talk about the latest launch of GPT Builder by OpenAI, which provides a fantastic platform through which enthusiasts can train their versions of GPT-4 and GPT-3 by finetuning and training on their data on OpenAI’s website. Well, what it ended up doing was that all the companies that have picked up hundreds of thousands of funding from VC Funds and Angel Investors, who didn’t have a moat stronger than their understanding of GPT models and just that, simply lost their entire business.

Companies that ended up creating apps and products surrounding OpenAI’s API and not doing their due diligence and have something revolutionary to add ended up just helping OpenAI perfect their GPT Builder.

In a situation like this, having access to an Open-Source Model like LLaMA 3 will help you stay updated and compete with the likes of GPT-4, all the while making sure that you are not just dealing with an API to a black box code but something that you can see, parameters that you can tweak, hyperparameters that you can control, and at the end of the day understand which is only possible with transparency and governance in AI.

What do we expect from the model?

Ever since the Fortune companies have been stifling again with the launch of Gemini by Google and the Integration of GPT-4 in Dall E-3, Meta has been quiet, making you wonder what they plan to do next. Much like the next iPhone or Pixel Phone, there is no question that Meta is developing the next edition of the model; however, what is somewhat surprising and exciting is the insane purchases made by Meta over the last few months.

Checking out the latest reports from Nvidia, we can see that Meta has been going toe-to-toe with Microsoft (Does anyone care about Bard anymore?) regarding orders placed for H100 GPUs. To give some context, H100 is the most powerful GPU out there for enterprises to train their models with the fantastic enterprise support provided for clusters of these GPUs in the form of Nvidia NVLink, making it capable of processing over trillion-parameters LLMs with only 256 GPUs.

With Meta’s order of over 150,000 such GPUs and the talks of the new model surpassing its predecessors and competition in terms of context and parameters is something that has been pointing towards some rather attractive models, with some rumors going as far as to say the most extensive version of the models will have over 120B Trainable Parameters.

Conclusion

In conclusion, our company is excited to have explored the possible specifications of LLaMA3, the upcoming successor to the LLaMA series. As a company working towards integrating large language models in our chatbots, we are dedicated to providing the best versions of LLaMA3 to the public. With the advancements in natural language processing and machine learning, LLaMA3 is poised to revolutionize the capabilities of language models. We are committed to leveraging these advancements to create chatbots that enhance user communication and understanding.

By harnessing the power of LLaMA3, we aim to push the boundaries of conversational AI and deliver chatbot experiences that are more intelligent, responsive, and personalized. Our team is dedicated to fine-tuning and optimizing LLaMA3 to meet our users’ diverse needs and use cases. We understand the importance of providing cutting-edge technology that addresses the challenges and demands of today's digital landscape. As we work towards integrating LLaMA3 into our chatbot offerings, we will prioritize user satisfaction, accessibility, and transparency.

Written By

Aryan Kargwal

Data Evangelist