Breaking

BreakingMirror FootballReports: England Fans Enter World Cup Opener Against Croatia Without Tickets· 11 hours ago BreakingYahoo Sports SoccerArsenal Prepares Bid for PSG Winger Bradley Barcola Amid Transfer Ambitions· 11 hours ago BreakingChannel News AsiaTuchel Expresses Delight After Second-Half Performance Secures Win Over Croatia· 11 hours ago BreakingBloomberg MarketsShort Seller Andrew Left's Mistrial Bid Denied Over Court Error· 11 hours ago BreakingIGNCall of Duty: Black Ops 1 and 2 Confirmed for PlayStation Ports in July· 11 hours ago BreakingDaily Mail FootballEngland Secures 4-2 Victory Over Croatia to Open World Cup Campaign· 11 hours ago BreakingYahoo Sports SoccerJude Bellingham: 'Chip on Shoulder' Propels England World Cup Performance· 11 hours ago BreakingGuardian FootballUzbekistan, Colombia Face Off in World Cup 2026 Qualifier· 11 hours ago BreakingSydney Morning HeraldGreg Inglis Endorses Billy Slater for Storm Coaching Role· 11 hours ago BreakingNDTV WorldIran to Impose Fees on Ships Crossing Strait of Hormuz After 60 Days· 11 hours ago BreakingMirror FootballReports: England Fans Enter World Cup Opener Against Croatia Without Tickets· 11 hours ago BreakingYahoo Sports SoccerArsenal Prepares Bid for PSG Winger Bradley Barcola Amid Transfer Ambitions· 11 hours ago BreakingChannel News AsiaTuchel Expresses Delight After Second-Half Performance Secures Win Over Croatia· 11 hours ago BreakingBloomberg MarketsShort Seller Andrew Left's Mistrial Bid Denied Over Court Error· 11 hours ago BreakingIGNCall of Duty: Black Ops 1 and 2 Confirmed for PlayStation Ports in July· 11 hours ago BreakingDaily Mail FootballEngland Secures 4-2 Victory Over Croatia to Open World Cup Campaign· 11 hours ago BreakingYahoo Sports SoccerJude Bellingham: 'Chip on Shoulder' Propels England World Cup Performance· 11 hours ago BreakingGuardian FootballUzbekistan, Colombia Face Off in World Cup 2026 Qualifier· 11 hours ago BreakingSydney Morning HeraldGreg Inglis Endorses Billy Slater for Storm Coaching Role· 11 hours ago BreakingNDTV WorldIran to Impose Fees on Ships Crossing Strait of Hormuz After 60 Days· 11 hours ago

Home/Technology

Technology

Source: Hacker News Frontpage

GateGPT Achieves 56,000 Tokens Per Second on FPGA for Transformer Models

A new system named GateGPT has reportedly achieved a processing speed of 56,000 tokens per second. This performance is attributed to its design as a Transformer, utilizing a Key-Value (KV) cache. The system is implemented on a Field-Programmable Gate Array (FPGA) and operates at a frequency of 80 MHz.

By Fainaron·Jun 16, 2026 (2 days ago)·2 views

GateGPT Achieves 56,000 Tokens Per Second on FPGA for Transformer Models

Share

GateGPT, a recently developed system, has reportedly demonstrated a processing speed of 56,000 tokens per second. This significant performance is achieved through its architecture, which is based on the Transformer model and incorporates a Key-Value (KV) cache.

The system is implemented on a Field-Programmable Gate Array (FPGA), operating at a frequency of 80 MHz. This combination of hardware and software design aims to optimize the execution speed of Transformer models.

The reported efficiency in token processing suggests potential advancements in accelerating AI inference tasks, particularly for large language models that rely on Transformer architectures.

(Source: Hacker News Frontpage)

#gategpt #transformer #fpga #ai acceleration #kv cache #performance

Source attribution: This article was AI-curated and rewritten by Fainaron from a piece originally published by Hacker News Frontpage. Read the original at Hacker News Frontpage →

More like this

Hugging Face Blog Introduces Agentic Resource Discovery Concept

Hugging Face Blog Introduces Agentic Resource Discovery Concept

Hugging Face Blog has presented a concept titled 'Agentic Resource Discovery.' This initiative focuses on the principle of enabling agents to perform search operations, aiming to empower autonomous agents in discovering various resources through dedicated search functions.

Hugging Face Blog

AI Usage Costs Spark 'ROI Reckoning' Among Tech Companies

AI Usage Costs Spark 'ROI Reckoning' Among Tech Companies

Silicon Valley saw a trend called "Tokenmaxxing" earlier this year, where CEOs encouraged maximum AI usage. This enthusiasm has since led to financial scrutiny, with several companies reportedly facing significant costs. Uber, for example, allegedly depleted its annual AI budget within months, while other firms scaled back on Claude licenses and Meta discontinued its internal AI leaderboard.

GLM-5.2 Model Designed for Long-Horizon Tasks

GLM-5.2 Model Designed for Long-Horizon Tasks

A new model, GLM-5.2, has been introduced with a primary focus on handling long-horizon tasks. This specialized design suggests an optimization for operations requiring sustained performance over extended periods. The development aims to address the unique challenges associated with such prolonged computational or processing demands.

Hugging Face Blog

Hugging Face Hub Connects AI Models to Robot Hardware

Hugging Face Hub Connects AI Models to Robot Hardware

Hugging Face is facilitating the direct deployment of artificial intelligence models from its Hugging Face Hub to robot hardware. This initiative integrates "Strands Agents" and "LeRobot" to bridge the gap between AI model development and physical robotics applications. The effort aims to streamline the transfer of AI capabilities to operational robot systems.

Hugging Face Blog

Back to Homepage

By the numbers

Fainaron — live counters

Updated every 30 seconds. Automatically — no human edits.

Total Articles

23.3K

Visitors Today

309

This Month

4.1K

Lifetime Visitors

4.1K

Article Views

38.6K

Pageviews Today

914

Pageviews Lifetime

27.8K

Last 30 Days

4.1K

as of 6/18/2026, 11:52:11 AM