Hacker Newsnew | past | comments | ask | show | jobs | submitlogin

I would recommend adding technical blogs. Not by hand, but if you can automate identifying some. Many are small but have good content.

Edit: also some corporate technical documentation like Mozilla, Microsoft, IBM, etc have many such developer pages.



I automate it by pulling urls out of HN, programmer Reddit, etc. Right now my only source of page content is the Common Crawl, which is why there are relatively few web pages indexed. That will change.

A next step is to index entire sites, not just individual pages, based on the positive votes their links get.




Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: