I have a matching pair of static functions in a utility class that I use to convert between binary data (unsigned characters) and it’s string representation (a-f and 0-9). They seemed to work correctly but recently I tried to compile my code under Visual C++ (2010 Express) and to my dismay, they cause nothing but heap corruption errors. What am I doing wrong?
void Utility::string_to_binary(const std::string source, unsigned char* destination, unsigned int length)
{
unsigned int effective_length = min(length, (unsigned int) source.length() / 2);
for(unsigned int b = 0; b < effective_length; b++)
{
sscanf(source.data() + (b * 2), "%02x", (unsigned int*) &destination[b]);
}
}
void Utility::binary_to_string(const unsigned char* source, unsigned int length, std::string& destination)
{
destination.clear();
for(unsigned int i = 0; i < length; i++)
{
char digit[3];
sprintf(digit, "%02x", source[i]);
destination.append(digit);
}
}
Edit: Here’s a complete program that illustrates the problem.
#include <iostream>
#include <hdcs/Utility.h>
using namespace std;
int main(int argc, char* argv[])
{
//Generate some data
unsigned int size = 1024;
unsigned char* data = new unsigned char[size];
//Convert it to it's string representation
string hex;
Utility::binary_to_string(data, size, hex);
//Output it to the screen
cout << hex << endl;
//Clear the data buffer
memset(data, 0, sizeof(unsigned char) * size);
//Convert the hex string back to binary
Utility::string_to_binary(hex, data, size);
//Cleanup
delete[] data;
}
The error occurs on delete[] data.
In this code,
you seem to be writing an
unsigned intat locationsdestination,destination+1,destination+2, &c. As you approach the final bytes of yourdestinationbuffer, you will write beyond its limit.For the sake of example, let us assume that destination is a four-byte buffer, and that
sizeof (unsigned int)is 4 in your environment. Then eachsscanfis writing four bytes.The first iteration writes bytes 0, 1, 2, 3
The second iteratino writes bytes 1, 2, 3, 4
The third iteration writes bytes 2, 3, 4, 5
The final iteration writes bytes 3, 4, 5, 6
Since the buffer was only four bytes to start with, you have written beyond the end of your buffer. Boom.
EDIT
The minimum change required to avoid this particular bug follows: