I am looking to start developing a relatively simple web application that will pull data from various sources and normalizing it. A user can also enter the data directly into the site. I anticipate hitting scale, if successful. Is it worth putting in the time now to use scalable or distributed technologies or just start with a LAMP stack? Framework or not? Any thoughts, suggestions, or comments would help.
Disregard my vague description of the idea, I’d love to share once I get further along.
A LAMP stack is scalable. Apache provides many, many alternatives.
Always use the highest-powered framework you can find. Write as little code as possible. Get something in front of people as soon as you can.
Focus on what’s important: Get something to work.
If you don’t have something that works, scalability doesn’t matter, does it?
Then read up on optimization. http://c2.com/cgi/wiki?RulesOfOptimization is very helpful.
Rule 1. Don’t.
Rule 2. Don’t yet.
Rule 3. Profile before Optimizing.
Until you have a working application, you don’t know what — specific — thing limits your scalability.
Don’t assume. Measure.
That means build something that people actually use. Scale comes later.