The output iterators *might* be broken?
Reported by Wildgoose | December 15th, 2008 @ 10:40 AM
Hi, I built a quick scrapper which returned me some data where the .to_hash looks like an array of hashes, with each hash being just a single named value. In other words I have the XML being an array of s and each of these contains a single element
I was able to use the output of scrubyt by taking the .to_hash and iterating over it. However, it appeared that I should also be able to do something like:
scrubyt_instance.each {|i| puts i}
However, this lead to nothing but empty lists and hashes no matter what I tried. No could I see how to get statistics out of this instance (it's clearly a class level function and doesn't even apply to an instance?)
The barrier to debugging this further is that I don't understand how the output iterators are really supposed to work, nor how the iterators are being implemented. Can you give some information on the entry point here and some examples of how it's supposed to work?
I do think it would be quite helpful if all the examples could be turned into tests so that they both document the internet and check that it's all working. Happy to assist with that, but could you help get it started and indicate whether you will receive patches..?
Thanks again for scrubyt!
Comments and changes to this ticket
-
Peter Szinek December 16th, 2008 @ 03:49 AM
- Assigned user set to Peter Szinek
- State changed from new to open
Could you post the example? It's kind of shooting in the dark without one...
Please Sign in or create a free account to add a new ticket.
With your very own profile, you can contribute to projects, track your activity, watch tickets, receive and update tickets through your email and much more.
Create your profile
Help contribute to this project by taking a few moments to create your personal profile. Create your profile ยป
A simple to learn and use, yet powerful web scraping toolkit written in Ruby.