This is based on Whisper by open ai, but it seems pretty easy to use and can be configured to use different models. Definitely something that could be easier than using the command line.
You can read more about it here: https://thewh1teagle.github.io/vibe/