Stripping Code...
I have been asked to build a site for a sports handicapper and he claims there is a program that strips paragraphs from major sites like ESPN and automatically updates the html on his page. I am trying to understand how this is done and I've got a few ideas, please tell me if I'm on the right track and/or how I should make this happen.
Make a script that gathers the source code for ESPN.com. (I am assuming the paragraphs giving brief tidbits of sports information is always in the same part of the code) Grab the specific lines where the articles are written and add those to a database or some other type of dynamic data container (text file, etc.). The homepage I am building would then reference that file and write the articles.
Is that right? How would I grab the source code? How would I strip a specific article?
If you have heard of this program please tell me the name of it. Thanks.
detox posted this at 00:31 — 3rd April 2001.
They have: 571 posts
Joined: Feb 2001
there are some programs out there that take content off the web for you,
drawing a blank at the moment, will get back to you.
Brian Farkas posted this at 00:50 — 3rd April 2001.
They have: 1,015 posts
Joined: Apr 1999
Try this:
http://cgi.resourceindex.com/Programs_and_Scripts/Perl/Content_Retrieval/
Want to join the discussion? Create an account or log in if you already have one. Joining is fast, free and painless! We’ll even whisk you back here when you’ve finished.