Please sign in to access this page
pre google style search engine.
Retroz
Whoops! Looks like they don't have a project yet. Maybe ask them to start one?
aryan
Whoops! Looks like they don't have a project yet. Maybe ask them to start one?
Aly
Check their projects out: Captain Redbeard's Adventure, Aly's Personal Website
Mohit Tiwari
Check their projects out: Lunar, Dogs Summit, Mohit's AI agent (slack bot)
jimmy
Check their project out: Networking Game
cskartikey
Check their projects out: Orpheus Editor, som, Summer of Making!
Wojtek
Check their projects out: ReactorTech, Webii, Unneh Engine
Once you ship this you can't edit the description of the project, but you'll be able to add more devlogs and re-ship it as you add new features!
Fix error when vpn the maxmind cant get the city name witch caused error for the weathers so just now says the country name. also news feeds worrrrkkkkkkkkkkkkkkkkkkkkkkkkkkkkkkkkkkkkkkkkkkkk.
https://github.com/yavuz/news-feed-list-of-countries/ good git repo and i have now added news sources for each country
FUN story for this one so i created a boosting algo it would boost if it has domain like if searched facebook you would get facebook.com well it wasnt working if you could see in past posts buttt i had a bad Index Desynchronization bug there is a bm25 file that listed all bm25 indexs and also line index file stored all the index for the file but it was off. why was it off line endings are two characters (\r\n) when i had one (\n) so it would of worked on linux or mac but on windows nooooooooooooooooooooo. so okay now it works :)))))))))))))))))))))))))))))))))))))))))))))))))))))))))))))))))))))))))))))))))))))))))))))))))))))))))))
I did something okay so i put BM25 formula. Score = sum over terms of [inverse document frequency × (term frequency × (saturation + 1)) ÷ (term frequency + saturation × (1 − normalization + normalization × (document length ÷ average document length)))]. honsetly its so confusing. and now has some more issues so il have to work on it.
IT WORKS. okay okay has some issues like the algo needs to be better and such. like Wikipedia is higher up when you search google but yeah. another issue is that the data on lines is greater than 1mb and having issues with then loading them as its big. but i did it. will keep you updated as i go. but enjoy a little demo i made
The data can be empty and ughhhhhhhhhhhhhhh thats why i was trying to parse metatags and such but its emptyyyyyy.
okay so I got searching done. well its very basic but i hateee how slow it is so most search engines use Inverted Index. well so lets do that but it would have to go through all the code again. woop woop time to script. well so first i want to remove any failed scraping. expect alot more devlogs
Haven't shown some code in while so here's the latest and greatest python scraper. its moves fast about 60 sites a second.
The scraping taking while i been updating and working on scraper while it does its thing i worked on ui. So the news is getting though rss feeds and market through npm package to get yahoo finance. i been working on not relying on api and rate limits so i have to get location for news i using maxmind GeoLite2-City.mmdb. to get the users country and location for weather and news.
Step one of building a search engine is getting data. i have got a list of top 10 million sites. i built a simple python bot to scrape the meta data for each site.