For me the biggest benefits are:
- Your queries don’t ever leave your computer
- You don’t have to trust a third party with your data
- You know exactly what you’re running
- You can tweak most models to your liking
- You can upload sensitive information to it and not worry about it
- It works entirely offline
- You can run several models
This is all very nuanced and there isn’t a clear cut answer. It really depends on what you’re running, for how long you’re running, your device specs, etc. The LLMs I mentioned in the post did just fine and did not cause any overheating if not used for extended periods of time. You absolutely can run a SMALL LLM and not fry your processor if you don’t overdo it.
Of course that is something to be mindful of, but that’s not what the person in the original comment said. It does run, but you need to be aware of the limitations and potential consequences. That goes without saying, though.
Don’t overdo it and your phone will be just fine.