How can i develop a news website like yahoo news?
Question:
Answer:
The simple answer is that if you have to ask, you don't have the prerequisite skills to do it and anything we answer will not be sufficient for you to do it.
It works by scanning for keywords or picking up RSS feeds.
I agree with the the first response, if you do not have a clue on this how to begin something, you really should look somewhere else where you have more knowledge of.
10 points to answer #1.
It's just using RSS feeds. Not that big a deal actually. The hard part is that the rss feeds from the news services are not free nor even cheap.
A poor man's cheat on subscribing to the AP would be to scan the front lines of several sources such as Rueters, CNN, etc. Then in your spider code rank by keyword associations. If the weight meets the correct criteria then you can update your display by loading the link in your database or programatically changing static html. Far easier to use PHP and periodically rescan your headline table. That's just a matter or putting a refresh value in your html.
Using PHP on a Linux system you can easily use wget to scan headlines and capture links to headlines. From PHP you can then analyze the results against a database driven keyword rank association. For example if the word Yankees appears then it's probably a baseball related article. In the case of CNN the URL itself contains a catagory link. Likewise your poor man's web spider using wget can scan headlines from specific topic specific URLS into specific dirs. This will give you a catagorization clue which will be important in custom filtering.
You can also write your own web spider using PERL, Rebol or other good scripting engines. Wget is probably much easier. The more detailed you want your filtering the more complex the PHP becomes and the more useful it might be to do some pre-processing on the SQL entries.
You can if you're rich.. but then... if I'm rich... I'll just go on holiday all year round.
More Questions & Answers...