Futo voice to text works nice and fast on my pixel 8 pro. Fractions of a second slower than google. Also that’s with the slower English 74 library (more data point, slower). They have an even larger one but the default is the smaller and faster English-39 model
And something like this can be used as the docker server to hold the repository
https://github.com/huncrys/docker-borg-server