I’m planning to deploy a 1B model, feed it all the documents I’ve ever written, host it on a $149 mini-PC in my bedroom, and enable you to chat with it.
I’ve released similar projects before.
I’ll drop a post about my plans in the coming days and I’ll build and document it about two weeks later if there’s enough interest.
Perhaps via an RNN like in https://huggingface.co/spaces/BlinkDL/RWKV-Gradio-2
Or even just leverage huggingface gradio spaces? (most are Gradio apps that expose APIs https://www.gradio.app/guides/view-api-page)