I asked another question about doing large queries in GAE, to which the answer was pretty much not possible.
What I want to do is this: from an iOS device, I get all the user’s contacts phone numbers. So now I have a list of say 250 phone numbers. I want to send these phone numbers back to the server and check to see which of these phone numbers belong to a User account.
So I need to do a query: query = User.query(User.phone.IN(phones_list))
However, with GAE, this is quite an expensive query. It will cost 250 reads for just this one query, and I expect to do this type of query often.
So I came up with a crazy idea. Why don’t I host the phone numbers on another host, on another database, where this type of query is cheaper. Then I can have GAE send a HTTP request to my other server to get the desired info.
So I have two questions:
- Are there any databases more streamlined to handle these kinds of
queries, and which it would be more cheaper to do? Or will it all be
the same as GAE? - Is this overkill? Is it a good idea? Should I suck it up and pay the cost?
GAE’s datastore should be good enough for your service. Since your application looks like could be parallelized very well.
1. use phone number as
key_nameofUser.As you set number as key_name of User, the following code will increase the query speed and reduce the read operation.
2. store multi number in one datastore.
the operation cost of GAE not directly related to the entity’s size. therefore a large entity store multi data would be another way to save the operation cost.
for example, store several phone number which have the same number_prefix together.
this method could further imporve with memcache.