[thelist] PHP html Feed modifying/parsing

Jay Blanchard jay.blanchard at niicommunications.com
Wed Jul 9 07:48:53 CDT 2003


[snip]
1) Have the server download the page every hour and store it locally on 
my server.
2) Search it to find the text <a href= ... </a> AFTER a certain point 
in the document <---startlist--> for example.
3) Save that information to a file that can be easily included on pages.

I assume the first part would need some kind of cron job or something 
different than php.  Not really sure how I'd implement that.  If 
absolutely necessary I could set something up to do it once a day.  The 
important part is step 2.

I'm not asking you to do this for me, but if you could give me a link 
to a tutorial which i've had trouble finding via google or at least a 
list of the functions I'll need in PHP.  I've been looking through the 
PHP.net manual and there's 10 billion str functions.
[/snip]

Have a look at the CURL functions in PHP for retrieving the HTML. Then
you can use regex to locate your start and end points, etc.

HTH!

Jay


More information about the thelist mailing list