How to use the AI Llava model on the computer and the benefits (for us and society)
Llava is a large multimodal model with 'visual' capabilities. This is how it works
3' min read
3' min read
Our personal image analyser with artificial intelligence, put on our computer, at our complete disposal. This is what we would get by loading Llava, a large multimodal model with 'visual' capabilities, locally, i.e. on a computer.
Because Llava on the computer
We tested it on the Lm Studio programme. It is true that multimodality has made great strides in recent months and is now available free of charge with OpenAi's new Gpt 4o, even on mobile apps. There are, however, several advantages to bringing such a system onto a computer, in addition to educational purposes, i.e. to study how these models work (they underpin a growing number of services, so practising with them is a good idea for the future).
Meanwhile, Gpt 4o is also available free of charge, but is limited in its functions for users who do not pay for a subscription. Some users also have an interest in keeping data and images they want to feed to the AI private on their PCs, instead of having them travel on the cloud.
With a programme like Lm Studio, this can be done quite easily. It is also a good way in general to try Llava, which when it came out (in October 2023) was also conveniently accessible for free via the web. The various online services offering it now, however, unlike last year, are overloaded and force long queues.
