Summer of Making

Arush Garg worked on GTA Image Captioner

2h 52m • 4 days ago

Git-large model fine-tuned instead for better performance

Arush Garg worked on GTA Image Captioner

2h 10m • 6 days ago

Model pushed to huggingface and code updated to work with Huggingface model. App deployed to streamlit for public demo

Arush Garg worked on GTA Image Captioner

2h 28m • 7 days ago

Refined app, added weight download and made quick demo video.

Arush Garg worked on ScrollPilot

4h 49m • 9 days ago

Created demo and made code more future proof

Arush Garg worked on GTA Image Captioner

8h 13m • 10 days ago

Fine-tuned first version of the model and basic app working

Arush Garg created a project

10d ago

GTA Image Captioner

By fine-tuning an image captioning transformer, I made a simple Streamlit app that can give a one-line description for a scene from GTA.

4 devlogs 0 followers Shipped

Arush Garg worked on ScrollPilot

8h 20m • 20 days ago

Updated search code for parallelism (learnt typescript promises in the process) and fixed bugs with extension

Arush Garg worked on ScrollPilot

7h 51m • 22 days ago

Created a basic extension with youtube-sr to search for videos and play them using the YouTube IFrame API.

Arush Garg created a project

23d ago

ScrollPilot

Scroll YouTube shorts in VS Code with this extension while you vibe code!

3 devlogs 0 followers Shipped

Arush Garg worked on Conversational Therapist

46m • about 2 months ago

Bug fixes and added spinner for better user experience

Arush Garg worked on Conversational Therapist

2h 1m • about 2 months ago

Added option to choose different text-to-speech services (ElevenLabs and gTTS)

Arush Garg worked on Conversational Therapist

3h 10m • about 2 months ago

Completed basic LLM integration and deployed to Streamlit cloud

Arush Garg worked on Conversational Therapist

7h 27m • about 2 months ago

Tried a few different approaches for speech detection, and finally finished the speech-to-speech part.

TODO: Refine the app, add LLM logic

Arush Garg created a project

52d ago

Conversational Therapist

Implemented a real-time speech to speech system to democratize access to personalized therapy sessions.

4 devlogs 0 followers Shipped

Arush Garg joined Summer of Making

57d ago

This was widely regarded as a great move by everyone.

Arush Garg

Stats

Coding Time

Member Since

Badges