<?xml version="1.0" encoding="UTF-8" ?>
<?xml-stylesheet type="text/xsl" href="http://arachnode.net/utility/FeedStylesheets/rss.xsl" media="screen"?><rss version="2.0" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:slash="http://purl.org/rss/1.0/modules/slash/" xmlns:wfw="http://wellformedweb.org/CommentAPI/"><channel><title>General Questions</title><link>http://arachnode.net/forums/7.aspx</link><description /><dc:language>en</dc:language><generator>CommunityServer 2008.5 SP1 (Debug Build: 31106.3070)</generator><item><title>Re: What is the effect of recrawl on WebPages_MetaData &amp; WebPage</title><link>http://arachnode.net/forums/thread/10730.aspx</link><pubDate>Mon, 17 Aug 2009 17:01:31 GMT</pubDate><guid isPermaLink="false">a2478770-777f-41ab-83b8-a21ff47ebb1f:10730</guid><dc:creator>arachnode.net</dc:creator><slash:comments>0</slash:comments><comments>http://arachnode.net/forums/thread/10730.aspx</comments><wfw:commentRss>http://arachnode.net/forums/commentrss.aspx?SectionID=7&amp;PostID=10730</wfw:commentRss><description>&lt;p&gt;IM when you are ready to move forward, of you already haven&amp;#39;t.&lt;/p&gt;
&lt;p&gt;-Mike&lt;/p&gt;&lt;div style="clear:both;"&gt;&lt;/div&gt;</description></item><item><title>Re: What is the effect of recrawl on WebPages_MetaData &amp; WebPage</title><link>http://arachnode.net/forums/thread/10701.aspx</link><pubDate>Sat, 15 Aug 2009 00:28:42 GMT</pubDate><guid isPermaLink="false">a2478770-777f-41ab-83b8-a21ff47ebb1f:10701</guid><dc:creator>arachnode.net</dc:creator><slash:comments>0</slash:comments><comments>http://arachnode.net/forums/thread/10701.aspx</comments><wfw:commentRss>http://arachnode.net/forums/commentrss.aspx?SectionID=7&amp;PostID=10701</wfw:commentRss><description>&lt;p&gt;Those dates are Database row &amp;quot;timestamps&amp;quot;.&lt;/p&gt;
&lt;p&gt;It gets populated/updated when the WebPage source changes.&lt;/p&gt;
&lt;p&gt;-Mike&lt;/p&gt;&lt;div style="clear:both;"&gt;&lt;/div&gt;</description></item><item><title>Re: What is the effect of recrawl on WebPages_MetaData &amp; WebPage</title><link>http://arachnode.net/forums/thread/10700.aspx</link><pubDate>Sat, 15 Aug 2009 00:27:59 GMT</pubDate><guid isPermaLink="false">a2478770-777f-41ab-83b8-a21ff47ebb1f:10700</guid><dc:creator>arachnode.net</dc:creator><slash:comments>0</slash:comments><comments>http://arachnode.net/forums/thread/10700.aspx</comments><wfw:commentRss>http://arachnode.net/forums/commentrss.aspx?SectionID=7&amp;PostID=10700</wfw:commentRss><description>&lt;p&gt;Yes - each WebPage and WebPage_MetaData is tied to the WebPage it crawled - and AbsoluteUri is an AbsoluteUri is an AbsoluteUri...&lt;/p&gt;
&lt;p&gt;The crawl process modifies the row, if existing.&lt;/p&gt;
&lt;p&gt;If an AbsoluteUri is in the DisallowedAbsoluteUris table it won&amp;#39;t be crawled.&lt;/p&gt;
&lt;p&gt;-Mike&lt;/p&gt;&lt;div style="clear:both;"&gt;&lt;/div&gt;</description></item><item><title>Re: What is the effect of recrawl on WebPages_MetaData &amp; WebPage</title><link>http://arachnode.net/forums/thread/10699.aspx</link><pubDate>Fri, 14 Aug 2009 15:35:34 GMT</pubDate><guid isPermaLink="false">a2478770-777f-41ab-83b8-a21ff47ebb1f:10699</guid><dc:creator>dbs2000</dc:creator><slash:comments>0</slash:comments><comments>http://arachnode.net/forums/thread/10699.aspx</comments><wfw:commentRss>http://arachnode.net/forums/commentrss.aspx?SectionID=7&amp;PostID=10699</wfw:commentRss><description>&lt;p&gt;Sorry missed out one more question.&lt;/p&gt;
&lt;p&gt;What does the lastDiscovered and lastModified dates tell us here exactly (WebPage). I have noticed that the lastModifed field is mostly null. When does it get populated.&lt;/p&gt;&lt;div style="clear:both;"&gt;&lt;/div&gt;</description></item><item><title>What is the effect of recrawl on WebPages_MetaData &amp; WebPage</title><link>http://arachnode.net/forums/thread/10697.aspx</link><pubDate>Fri, 14 Aug 2009 15:22:00 GMT</pubDate><guid isPermaLink="false">a2478770-777f-41ab-83b8-a21ff47ebb1f:10697</guid><dc:creator>dbs2000</dc:creator><slash:comments>0</slash:comments><comments>http://arachnode.net/forums/thread/10697.aspx</comments><wfw:commentRss>http://arachnode.net/forums/commentrss.aspx?SectionID=7&amp;PostID=10697</wfw:commentRss><description>&lt;p&gt;Hi,&lt;/p&gt;
&lt;p&gt;I have a small doubt. Does the WebPages_MetaData text /
xml content change (or get updated) when I recrawl the page the next
day. Or does it create a new WebPageID (in WebPage) &amp;amp; enter a new record in
WebPages_MetaData table if it gets recrawled. Or does it check for any
modifcations in the page (or its contents) &amp;amp; if modified then
creates a record or updates the existing WebPages_MetaData record?&lt;/p&gt;
&lt;p&gt;Also
does the recrawl take into account what is there in the
disallowed table &amp;amp; bypass crawling if the url is present over
there?&lt;/p&gt;
&lt;p&gt;Please note that the recrawl that I am talking about will be in separate runs on different dates.&lt;/p&gt;
&lt;p&gt;Thanks&lt;/p&gt;
&lt;p&gt;Debasish&lt;/p&gt;&lt;div style="clear:both;"&gt;&lt;/div&gt;</description></item></channel></rss>