Proposal for crowd-sourced, open-source distilled LLMs via distributed training
A Reddit user proposes a system to create truly open-source distilled large language models by wrapping existing command-line AI services. This approach would collect user inputs and outputs from applications like coding assistants or chatbots to build massive datasets through volunteer participation.