Atlas / Reports / Detail
BTLM-3B-8K: 7B Performance in a 3B Model
Large Language Models report from Cerebras Systems with 9 connected researchers in the LLMpeople atlas.
Connected researchers
Nhan Tran
Cerebras Systems
Researcher at Cerebras Systems and coauthor of the Cerebras-GPT: Open Compute-Optimal Language Models Trained on the Cerebras Wafer-Scale Cluster.
Maximilian Welt
Cerebras Systems
Researcher at Cerebras Systems and coauthor of the BTLM-3B-8K: 7B Performance in a 3B Model.
Alexander Lavin
Cerebras Systems
Researcher at Cerebras Systems and coauthor of the BTLM-3B-8K: 7B Performance in a 3B Model.
Nicolas Rajbhandari
Cerebras Systems
Researcher at Cerebras Systems and coauthor of the BTLM-3B-8K: 7B Performance in a 3B Model.
Harsha Chilakapati
Cerebras Systems
Researcher at Cerebras Systems and coauthor of the BTLM-3B-8K: 7B Performance in a 3B Model.
Hrant Khachatrian
Cerebras Systems
Hrant Khachatrian is a machine learning researcher and engineer at FAST and a co-founder of YerevaNN, with work spanning natural language processing, computer vision, and efficient language models.
Arun Jindal
Cerebras Systems
Researcher at Cerebras Systems and coauthor of the BTLM-3B-8K: 7B Performance in a 3B Model.
Yejin Choi
Cerebras Systems
Professor at Stanford University and senior director at the Allen Institute for AI. Her research spans natural language processing, commonsense reasoning, and multimodal AI.
Carlo Strapparava
Cerebras Systems
Researcher at Cerebras Systems and coauthor of the BTLM-3B-8K: 7B Performance in a 3B Model.