diff --git a/README.md b/README.md index c6de432..363b85f 100644 --- a/README.md +++ b/README.md @@ -14,10 +14,8 @@ It has the implementation of fundamental research to improve modeling generality - Efficiency - [**X-MoE**](https://arxiv.org/abs/2204.09179): scalable & finetunable sparse Mixture-of-Experts (MoE) ### Revolutionizing Transformers for (M)LLMs and AI - -> [**RetNet**](https://arxiv.org/abs/2307.08621): Retentive Network: A Successor to Transformer for Large Language Models - -> [**LongNet**](https://arxiv.org/abs/2307.02486): Scaling Transformers to 1,000,000,000 Tokens +- [**RetNet**](https://arxiv.org/abs/2307.08621): Retentive Network: A Successor to Transformer for Large Language Models +- [**LongNet**](https://arxiv.org/abs/2307.02486): Scaling Transformers to 1,000,000,000 Tokens ## News