Feature request
Currently this package assumes that the user workstation has a GPU or enough local capacity to run a LLM.
Trellis splits an LLM across multiple workstations with an encrypted channel between them, which enables large models and multiple model consumers.
Once an LLM is loaded onto a grid, the CLI exposes an Anthropic-like API which can be consumed by clients such as Nextcloud/llm2 : https://trellis.unfoldml.com/docs#api
Disclaimer 1: Trellis is a paid service with a monthly plan, and the client (where the compute runs) is currently proprietary software.
Disclaimer 2: I am the founder/author of Trellis, actively developing the project
Thank you and looking forward to your feedback !
Feature request
Currently this package assumes that the user workstation has a GPU or enough local capacity to run a LLM.
Trellis splits an LLM across multiple workstations with an encrypted channel between them, which enables large models and multiple model consumers.
Once an LLM is loaded onto a grid, the CLI exposes an Anthropic-like API which can be consumed by clients such as Nextcloud/llm2 : https://trellis.unfoldml.com/docs#api
Disclaimer 1: Trellis is a paid service with a monthly plan, and the client (where the compute runs) is currently proprietary software.
Disclaimer 2: I am the founder/author of Trellis, actively developing the project
Thank you and looking forward to your feedback !