Neural Magic (@neuralmagic) Twitter Tweets • TwiCopy

repeat2

account_circle

vLLM

@vllm_project

3 weeks ago

We are doubling our committer base for vLLM to ensure it is best-in-class and a truly community effort. This is just a start. Let's welcome Kaichao You, Philipp Moritz, Nick Hill, Roger Wang, Cade Daniel 🇺🇸, Robert Shaw as committers and thank you for your great work! 👏

thumb_up_off_alt31

repeat4

account_circle

ridgelinevc

@ridgelinevc

1 month ago

Generative AI is a treasure trove of opportunity. Has @NeuralMagic struck gold?

Dive into a Q&A with CEO Brian Stevens in TechBullion covering 👉

1️⃣ Neural Magic's technology story
2️⃣ Future generative AI capabilities
3️⃣ New strategic partnerships

twitter.com/TechBullion/st…

thumb_up_off_alt3

account_circle

Neural Magic

@neuralmagic

1 month ago

What a busy 2024 for our product and engineering teams! They summarize it all in our Q1 product release blog: neuralmagic.com/blog/neural-ma…

From substantial advancements in AI model training, optimization, and deployment for LLMs on CPUs... to the launch of nm-vllm, which enables GPU…

thumb_up_off_alt6

repeat0

account_circle

Neural Magic

@neuralmagic

1 month ago

Had a great AI Day with Cerebras! Our CTOs, Mark Kurtz and Sean Lie, with their teams, released expertly-optimized Llama 2 7B models that have been sparsified for superior performance and memory: huggingface.co/collections/ne…

🙏 James Wang (in NYC) and Julie Choi for including us!

Had a great AI Day with @CerebrasSystems! Our CTOs, @markurtz_ and Sean Lie, with their teams, released expertly-optimized Llama 2 7B models that have been sparsified for superior performance and memory: huggingface.co/collections/ne… 🙏 @draecomino and Julie Choi for including us!

thumb_up_off_alt14

account_circle

Derrick Mwiti

@_mwitiderrick

1 month ago

Have you had a chance to check out Marlin?

It's a state-of-the-art 4-bit quantized inference kernel that can deliver close to 4x speedups up to batch sizes of 16-32 tokens.

It is well-suited for larger-scale serving, speculative decoding, or advanced multi-inference schemes.

thumb_up_off_alt3

account_circle

TechBullion

@TechBullion

1 month ago

Democratize AI Using Optimized CPUs As The Onramp To Generative AI: Interview with Neural Magic CEO Brian Stevens Neural Magic techbullion.com/democratize-ai… #AI #GenerativeAI #software TechBullion

thumb_up_off_alt1

account_circle

Robert Blumofe

@RobertBlumofe

1 month ago

Doing things smarter with math, algorithms, and software... That's what Akamai Technologies is all about. Akamai and Neural Magic team up to accelerate AI workloads on edge CPU servers siliconangle.com/2024/03/12/aka… via SiliconANGLE

thumb_up_off_alt11