By fine-tuning an image captioning transformer, I made a simple Streamlit app that can give a one-line description for a scene from GTA.
No followers yet
Once you ship this you can't edit the description of the project, but you'll be able to add more devlogs and re-ship it as you add new features!
Model pushed to huggingface and code updated to work with Huggingface model. App deployed to streamlit for public demo