Journalist

HugoAffaticati

Author of "Microsoft's Azure Sets Record With 1M-Token-Per-Second AI Inference" in Microsoft Tech Community

Mentions

Articles

Outlets

Topics Most Covered

Companies Covered

Writing Patterns

How this journalist typically writes

Article Types

announcement1

Preferred Angles

technical1

Narrative Framing

breakthrough1

Associated AI Models

Llama2 70B1

Writes About

Mark GitauEngineer

1 article

Hugo AffaticatiEngineer

1 article

Signal65Researcher

1 article

Articles

Most recent first

Articles Written

HugoAffaticati as author

Microsoft's Azure Sets Record With 1M-Token-Per-Second AI Inference

Microsoft Tech CommunityannouncementpositiveNov 19, 2025

Microsoft Azure ND GB300 v6 virtual machines achieve a record 1.1 million tokens per second on Llama2 70B inference, a 27% improvement over the previous ND GB200 v6 record and 5× higher throughput per GPU than previous-generation H100 systems.

“Author of "Microsoft's Azure Sets Record With 1M-Token-Per-Second AI Inference" in Microsoft Tech Community”

AI InfrastructureFoundation ModelsAI Chips/HardwareLLMs