Researcher at Unknown
Co-author of the paper on Scalable In-context Ranking with Generative Models
How media typically covers Sanjiv Kumar
Sanjiv Kumar as author
BlockRank enables scalable in-context ranking with generative models by reducing attention complexity from quadratic to linear, achieving 4.7x inference speedup while handling ~500 documents in-context within a second.
“Co-author of the paper on Scalable In-context Ranking with Generative Models”