Both tools have online demos that you can test your skills with. The plan is to use these miners to grab headlines from our national papers and display them on the portal main page so that people can click and go to the appropriate site for full content.
Azkar bhai, what exactly are you looking for? Do you want to pull the headlines once a day? Every few hours?? This can be done with a simple script, formatted completely to one's specifications. Infact I think that will probably be the best solution. Even better would be to get in touch with those newspapers, and ask them to provide an rss feed specifically for you guys.
I think a Perl script can be setup with a cronjob to extract the latest news and publish it as an XML feed .. i have written couple of these perl scripts to extract data from websites.. so let me knw if you need any help :)
aah c’mon man… don’t let a few rotten experiences affect your overall proclivity. We all value your input here and I for one learn a lot from your experiential accounts.
every 4 hours. i used to use newsclipper but dont have the time and energy to write all the codes again. thats why i wanted to use something different.
the news people, ughh .. forget them.
mm.. did that for someone using python a while ago..(it was all parsing with no Form Feeds)
but problem with pakistani URDU newspapers is that they have the news in Image format..
for english ..YES i would say RSS/RDF.
You can use any language for doing that. with PHP having lots of base support. for that.
would luv to help it out here..
I actually had not written this when you made this post :p.
I just had an idea that how is this possible.. so after some struggle of a couple of hours.. here is the code…
woh open source wala cheeta kahan hai jo kuch din pehlay naray laga raha tha farangi programmes k? lol
I was waiting and watching the way programmers on this forum think of doing this.
To all those RSS lovers: RSS is not for programmers.