indexing - Create a search engine on specific sites and gather specific info -


i need create search engine crawls thru list of websites , searches there query, , website return data in various formats , structures, need collect specific info (in unique structure) these websites.

is there way can existing engine such google custom search engine?
or better of creating own? if yes, what's first step should take towards learning indexing , searching these website efficiently , without filling servers unuseful trash.

so sum up, besides searching query on each of these websites' search box, need handle results of each of them appropriately , lay on in union structure in 1 place altogether. results parsed , extracted 4-6 fields (unless, of course, there way google cse.

google cse provides interfaces standard google web search. can control user interface , search parameters, have no control on indexing, nor direct access index data.

you might more interested in google search api's available gae. these quite different: search services in provide data , control indexes.


Comments

Popular posts from this blog

PHPMotion implementation - URL based videos (Hosted on separate location) -

javascript - Using Windows Media Player as video fallback for video tag -

c# - Unity IoC Lifetime per HttpRequest for UserStore -