Michele Dolfi
|
4cc6e3ea5e
|
feat: Describe pictures using vision models (#259)
* draft for picture description models
Signed-off-by: Michele Dolfi <dol@zurich.ibm.com>
* vlm description using AutoModelForVision2Seq
Signed-off-by: Michele Dolfi <dol@zurich.ibm.com>
* add generation options
Signed-off-by: Michele Dolfi <dol@zurich.ibm.com>
* update vlm API
Signed-off-by: Michele Dolfi <dol@zurich.ibm.com>
* allow only localhost traffic
Signed-off-by: Michele Dolfi <dol@zurich.ibm.com>
* rename model
Signed-off-by: Michele Dolfi <dol@zurich.ibm.com>
* do not run with vlm api
Signed-off-by: Michele Dolfi <dol@zurich.ibm.com>
* more renaming
Signed-off-by: Michele Dolfi <dol@zurich.ibm.com>
* fix examples path
Signed-off-by: Michele Dolfi <dol@zurich.ibm.com>
* apply CLI download login
Signed-off-by: Michele Dolfi <dol@zurich.ibm.com>
* fix name of cli argument
Signed-off-by: Michele Dolfi <dol@zurich.ibm.com>
* use with_smolvlm in models download
Signed-off-by: Michele Dolfi <dol@zurich.ibm.com>
---------
Signed-off-by: Michele Dolfi <dol@zurich.ibm.com>
|
2025-02-07 16:30:42 +01:00 |
|