AMiner - AI赋能科技情报挖掘-学术搜索-论文检索-论文专利-文献追踪-学者画像

Kimi proposed a new attention mechanism, MoBA, which combines the principles of MoE and improves the efficiency of LLMs in long-text scenarios without sacrificing performance.

No More Adam: Learning Rate Scaling at Initialization is All You Need

Minghao Xu, Lichuan Xiang,Xu Cai,Hongkai Wen

CoRR （2024）

Cited2Views2012

Download

Bibtex

ChatPaper

Rate

2012

Smarter, Better, Faster, Longer: A Modern Bidirectional Encoder for Fast, Memory Efficient, and Long Context Finetuning and Inference

Benjamin Warner, Antoine Chaffin,Benjamin Clavié,Orion Weller, Oskar Hallström, Said Taghadouini, Alexis Gallagher, Raja Biswas,Faisal Ladhak, Tom Aarsen,Nathan Cooper,Griffin Adams,

CoRR （2024）

Cited69Views1426

Download

Bibtex

ChatPaper

Rate

1426

TheAgentCompany: Benchmarking LLM Agents on Consequential Real World Tasks

Frank F. Xu, Yufan Song, Boxuan Li, Yuxuan Tang, Kritanjali Jain, Mengxue Bao, Zora Z. Wang,Xuhui Zhou, Zhitong Guo, Murong Cao, Mingyang Yang, Hao Yang Lu,

Computing Research Repository （2024）

Cited22Views1201

Download

Bibtex

ChatPaper

Rate

1201

Expand all 5 New Papers

Loading more RecommendationsGet more recommendations Load MoreAdd KeywordSet your interests to get accurate recommendation

京ICP备20011824号-11 网信算备110108105858001230019 Beijing-ChatGLM-20230821 gongan