Notes on Microsoft's FastContext, and a small SWE-QA experiment with retrieval hints
The author analyzes Microsoft's FastContext paper and presents an alternative approach using offline semantic search to reduce token usage in coding agents. By indexing repositories beforehand and providing file-range hints to Claude Code, the method achieved a 43.8% drop in total tokens while maintaining equivalent solution quality on SWE-QA.