arachnode.net
An Open Source C# web crawler with Lucene.NET search using SQL Server 2008/2012/2014/2016/CE An Open Source C# web crawler with Lucene.NET search using MongoDB/RavenDB/Hadoop
Search the Live Index Does arachnode.net scale? | Download the latest release

Frames

rated by 0 users
Answered (Verified) This post has 1 verified answer | 3 Replies | 2 Followers

Top 25 Contributor
23 Posts
JCrawl posted on Fri, Jul 17 2015 4:46 PM

Hello 

 

I have noticed that the crawlreqests do not contain the content found in frames. Is it possible to get this? Is there something I did not set right?

 

Thank you 

 

Answered (Verified) Verified Answer

Top 10 Contributor
1,905 Posts
Verified by arachnode.net

OK, frames and iframes are different things.

http://stackoverflow.com/questions/1079128/whats-the-difference-between-iframe-and-frame

In this example, the iframe is an inline element - and if it isn't being populated it is very likely due to needing the associated JavaScript in the WebPage to do so.

Use the Renderers: http://arachnode.net/blogs/arachnode_net/archive/2015/04/01/ajax-dynamic-content.aspx

Mike

For best service when you require assistance:

  1. Check the DisallowedAbsoluteUris and Exceptions tables first.
  2. Cut and paste actual exceptions from the Exceptions table.
  3. Include screenshots.

Skype: arachnodedotnet

All Replies

Top 10 Contributor
1,905 Posts

Which page, specifically?

For best service when you require assistance:

  1. Check the DisallowedAbsoluteUris and Exceptions tables first.
  2. Cut and paste actual exceptions from the Exceptions table.
  3. Include screenshots.

Skype: arachnodedotnet

Top 25 Contributor
23 Posts
JCrawl replied on Fri, Jul 24 2015 9:06 PM

Hello 

 

Sorry been on Vacation... In my example, I cannot access the product description as the contents are in a frame

 

http://www.amazon.com/First-Baby-Annabell-Soft-Doll/dp/B00FBWB9A2

 

I can see the contents when I view source but when I look at the content dnloaded from AN it is missing 

<iframe id="product-description-iframe" class="ap_never_hide" width="100%" scrolling="no" frameborder="0" style="height: 86px;">

</iframe>

 

The contents are missing....

 

 

Top 10 Contributor
1,905 Posts
Verified by arachnode.net

OK, frames and iframes are different things.

http://stackoverflow.com/questions/1079128/whats-the-difference-between-iframe-and-frame

In this example, the iframe is an inline element - and if it isn't being populated it is very likely due to needing the associated JavaScript in the WebPage to do so.

Use the Renderers: http://arachnode.net/blogs/arachnode_net/archive/2015/04/01/ajax-dynamic-content.aspx

Mike

For best service when you require assistance:

  1. Check the DisallowedAbsoluteUris and Exceptions tables first.
  2. Cut and paste actual exceptions from the Exceptions table.
  3. Include screenshots.

Skype: arachnodedotnet

Page 1 of 1 (4 items) | RSS
An Open Source C# web crawler with Lucene.NET search using SQL 2008/2012/CE

copyright 2004-2017, arachnode.net LLC