I know that serializing an object is (to my knowledge) the only way to effectively deep-copy an object (as long as it isn’t stateful like IO and whatnot), but is one way particularly more efficient than another?
For example, since I’m using Rails, I could always use ActiveSupport::JSON, to_xml – and from what I can tell marshalling the object is one of the most accepted ways to do this. I’d expect that marshalling is probably the most efficient of these since it’s a Ruby internal, but am I missing anything?
Edit: note that its implementation is something I already have covered – I don’t want to replace existing shallow copy methods (like dup and clone), so I’ll just end up likely adding Object::deep_copy, the result of which being whichever of the above methods (or any suggestions you have 🙂 that has the least overhead.
I was wondering the same thing, so I benchmarked a few different techniques against each other. I was primarily concerned with Arrays and Hashes – I didn’t test any complex objects. Perhaps unsurprisingly, a custom deep-clone implementation proved to be the fastest. If you are looking for quick and easy implementation, Marshal appears to be the way to go.
I also benchmarked an XML solution with Rails 3.0.7, not shown below. It was much, much slower, ~10 seconds for only 1000 iterations (the solutions below all ran 10,000 times for the benchmark).
Two notes regarding my JSON solution. First, I used the C variant, version 1.4.3. Second, it doesn’t actually work 100%, as symbols will be converted to Strings.
This was all run with ruby 1.9.2p180.
results in: