<?xml version="1.0" encoding="UTF-8" ?>
<?xml-stylesheet type="text/xsl" href="http://arachnode.net/utility/FeedStylesheets/rss.xsl" media="screen"?><rss version="2.0" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:slash="http://purl.org/rss/1.0/modules/slash/" xmlns:wfw="http://wellformedweb.org/CommentAPI/"><channel><title>Search results matching tags 'robots.txt' and 'Globalization'</title><link>http://arachnode.net/search/SearchResults.aspx?o=DateDescending&amp;tag=robots.txt,Globalization&amp;orTags=0</link><description>Search results matching tags 'robots.txt' and 'Globalization'</description><dc:language>en-US</dc:language><generator>CommunityServer 2008.5 SP1 (Debug Build: 31106.3070)</generator><item><title>Help Needed</title><link>http://arachnode.net/forums/p/691/11072.aspx#11072</link><pubDate>Tue, 06 Oct 2009 22:37:34 GMT</pubDate><guid isPermaLink="false">a2478770-777f-41ab-83b8-a21ff47ebb1f:11072</guid><dc:creator>vishalishere</dc:creator><description>&lt;p&gt;Hi,&lt;/p&gt;
&lt;p&gt;I would greatly&amp;nbsp;appreciate&amp;nbsp;any help, I am new to AN, i managed to get it up and running, It was actually not bad, just 4 easy steps and it&amp;#39;s up and running, now here is what i am trying to achieve,&amp;nbsp;&lt;/p&gt;
&lt;p&gt;1) I have a list of web sites approximately 200 sites (these are job sites, job aggregators, companies having job posting pages etc)&lt;/p&gt;
&lt;p&gt;2) I want to use AN to crawl each site and extract the content crawled and store it in db, because each job would have some title (hopefully) tag the result appropriately, for example a software engineer job with skillset as C#, ASP.net may be tagged as &amp;quot;Software Engineer&amp;quot;, &amp;quot;ASP.net&amp;quot;, &amp;quot;Developer&amp;quot;, &amp;quot;C#&amp;quot;, &amp;quot;May be job location&amp;quot;&lt;/p&gt;
&lt;p&gt;3) Repeat this process every 2 days and update the database&lt;/p&gt;
&lt;p&gt;4) while crawling if the job posts have email addresses, phone numbers, web addresses then store them separately but link them to crawled content&lt;/p&gt;
&lt;p&gt;5) AN to run as a service&lt;/p&gt;
&lt;p&gt;6) Afterwards I want to put a WebApp that shows the results on a web page based on user entered criteria ran over the crawled results,&amp;nbsp;&lt;/p&gt;
&lt;p&gt;I know this is quite a lot to ask, I am also trying to get it going myself but it would be much quicker if I can get a helping hand on this.&lt;/p&gt;
&lt;p&gt;With Regards,&lt;/p&gt;
&lt;p&gt;Vishal&lt;/p&gt;
&lt;p&gt;vishalishere@msn.com&lt;/p&gt;</description></item></channel></rss>