Offline GPU Build Picker Estimates Local Model Fit and Speed
A developer has released an offline, single-file HTML tool that estimates which local large language models will fit on a specific GPU configuration and predicts their token generation speed. The tool is designed to answer the common question of whether a custom PC build can run desired models effectively, without requiring a backend or user account.