Open source Mamba 3 arrives to surpass Transformer architecture with nearly 4% improved language modeling, reduced latency

via openai.com

Short excerpt below. Read at the original source.

The generative AI era began for most people with the launch of OpenAI’s ChatGPT in late 2022, but the underlying technology — the “Transformer” neural network architecture that allows AI models to weigh the importance of different words in a sentence (or pixels in an image) differently and train on information in parallel — dates […]

Read at Source