Image by Author
LlaMA 2 is a family of state-of-the-art open-source large language models released by Meta AI. You can use it for commercial use, and it comes with the code, pre-trained models, and fine-tuned models. All of the resources are available at HuggingFace, and you can even experience the model performance by trying it out on HuggingChat. By making Llama 2 openly available, Meta AI is enabling researchers and developers to build innovative applications powered by advanced language capabilities.
Image from HuggingChat
Claude 2 is the latest iteration of Anthropic’s conversational AI assistant. It has improved performance, longer responses, and can be accessed via API as well as a new public-facing beta website, claude.ai. The developers at Anthropic have focused on enhancing its abilities in areas like coding, math, and logical reasoning compared to previous Claude versions. For example, Claude2 recently scored 76.5% on the multiple-choice section of the Bar exam, a significant jump up from 73.0% for Claude 1.3.
You can access all types of Claude models on Poe and experience the performance yourself.
Image from Poe
Google AI PaLM 2 is Google’s latest large language model that excels at advanced reasoning tasks, including code, math, classification, question answering, translation, multilingual proficiency, and natural language generation. It outperforms previous state-of-the-art large language models like the original PaLM across all these capabilities due to its optimized compute-scaling approach, enhanced dataset mixture, and architectural improvements.
You can access it for free using Bard.
There is an enchantment, but it is still far away from GPT-4 quality and performance.
Image from Bard
Vicuna-33b-v1.3 was fine-tuned from LLaMA with supervised instruction fine-tuning on 125K conversations collected from ShareGPT.com. It is one of many top performing models on Open LLM Leaderboard. You can access the model for free on HuggingFace or try the official demo on lmsys.org.
Image from lmsys.org
MPT-30B-Chat is a chatbot that was fine tuned to generate the dialogues. It was created by fine tuning the MPT 30B on multiple dialogue datasets ( ShareGPT-Vicuna, Camel-AI, GPTeacher, Guanaco, Baize and some generated datasets). MPT-30B-Chat is one of the top model on Open LLM leaderboard and you can experience it for free on a Hugging Face Space by mosaicml.
Image from MPT-30B-Chat
While GPT-4 remains closed and inaccessible, exciting open-source large language models are emerging as alternatives that anyone can use. Models like Anthropic’s Claude2, Meta’s LLaMA2, and MPT-30B show remarkable progress in conversational ability, reasoning, and multilingual versatility. Although not as massive in scale as GPT-4, these freely available models demonstrate that state-of-the-art language AI continues to advance rapidly. Their strengths in areas like math, coding, and logic make them capable replacements for many applications.
After the launch of LlaMA2 models, there has been a boom of high-performing models that are fine-tuned on various datasets. You can check all of them on the Open LLM Leaderboard.
Abid Ali Awan (@1abidaliawan) is a certified data scientist professional who loves building machine learning models. Currently, he is focusing on content creation and writing technical blogs on machine learning and data science technologies. Abid holds a Master’s degree in Technology Management and a bachelor’s degree in Telecommunication Engineering. His vision is to build an AI product using a graph neural network for students struggling with mental illness.