Tycoon Talk
Become a Big fish!
The number 1 forum for online business!
Post topics, ask questions, share your knowledge.
Tycoon Talk is part of Freelancer.com - find skilled workers online at a fraction of the cost.

Coding Forum


You are currently viewing our Coding Forum as a guest. Please register to participate.
Login



Reply
Content Extracting Tool
Old 02-28-2006, 09:09 PM Content Extracting Tool
Average Talker

Posts: 20
Trades: 0
Hey guys,

I'm looking for a script or other type of software program that allows me to crawl my ecommerce website in order to generate a CSV or XML datafeed with all my products.

Unfortunately my company is using a pretty ol JSP based software and there is not exporting function for the database.

So I thought I can include certain comment-tags into my website and have a software extract all the product info such as price, description, category etc.

Does anyone know any good software tools? Or is there another way to do it?

Thanks a lot for your help!!

Kai
brakai295 is offline
Reply With Quote
View Public Profile
 
 
Register now for full access!
Old 02-28-2006, 10:20 PM Re: Content Extracting Tool
Anacrusis's Avatar
Defies a Status

Posts: 2,099
Name: Adam
Location: Colchester CT
Trades: 0
Are your products stored in a database? It might be easier to write a quick script to pull the products out of the db
Anacrusis is offline
Reply With Quote
View Public Profile
 
Old 02-28-2006, 11:26 PM Re: Content Extracting Tool
Average Talker

Posts: 20
Trades: 0
Quote:
Originally Posted by Anacrusis
Are your products stored in a database? It might be easier to write a quick script to pull the products out of the db
Yeah, i know it sounds much easier, but we are getting an encrypted datafeed from several external suppliers. i think spidering the website is less complicated :-( any suggestions?

THANKS
kai
brakai295 is offline
Reply With Quote
View Public Profile
 
Old 03-05-2006, 06:17 PM Re: Content Extracting Tool
sdcdesign.co.uk's Avatar
Extreme Talker

Posts: 198
Location: High Wycombe, Buckinghamshire, London
Trades: 0
Maybe this is longwinded...

But have a tag like this:

Code:
<! prodInfo="thiswillbeafixedlength" !>
Then in your crawling page find the line that equals

<! prodInfo="??????????????????????" !>

and your gonna have to find a way in which you can tell your script that '?' equals any character.



Maybe thats just outta me bum :P
__________________
[ Insert witty, yet highly intelligent signature here ]
sdcdesign.co.uk is offline
Reply With Quote
View Public Profile Visit sdcdesign.co.uk's homepage!
 
Old 04-26-2006, 10:45 PM Re: Content Extracting Tool
Average Talker

Posts: 20
Trades: 0
Hi,

we are still looking for someone that can help us, crawl our pages. Anyone know a good tool/software or company that can help?

THanks,

kai
brakai295 is offline
Reply With Quote
View Public Profile
 
Old 05-03-2006, 12:23 AM Re: Content Extracting Tool
Banned

Posts: 6
Trades: 0
I think spidering the website is easier
airpr23 is offline
Reply With Quote
View Public Profile
 
Old 05-03-2006, 12:44 AM Re: Content Extracting Tool
ADAM Web Design's Avatar
Canadastaninianite

Posts: 5,938
Name: Adam for web page design, not program
Location: Toronto, Ontario, Canada
Trades: 0
If your server supports ASP, you could crawl the site using ASPTear. I haven't tried the commercial component yet, but the free one's not bad (although it can run a bit heavy sometimes.)
__________________

Please login or register to view this content. Registration is FREE
|
Please login or register to view this content. Registration is FREE
(my blog)


Please login or register to view this content. Registration is FREE
(with proof)
ADAM Web Design is offline
Reply With Quote
View Public Profile Visit ADAM Web Design's homepage!
 
Reply     « Reply to Content Extracting Tool
 

Thread Tools Search this Thread
Search this Thread:

Advanced Search

Posting Rules
You may not post new threads
You may not post replies
You may not post attachments
You may not edit your posts

BB code is On
Smilies are On
[IMG] code is On
HTML code is Off
Trackbacks are Off
Pingbacks are Off
Refbacks are Off





   
RSS Feed  Feeds: RSS   JS   XML
RSS Feed  Feeds for this forum: RSS   JS   XML



Page generated in 0.26357 seconds with 12 queries