I have quite a big network in a CSV file. It containt 450k nodes and 45 000 000 relationships. As I’ve read in neo4j documentation this type of database can handle such a big network.
I’ve also read that I can use embeded server as well as stand alone one.
My question is what is the difference between them? I would like to have a server which holds its database state.
Second question is that I can use REST API to perform operations on database, an Java API to do that.
What is the difference in performance ? I would like for example to have as an output all nodes levels.
Is it possible to load graph from CSV?
What is the best solution for my problem?
The embedded database sits in the same process as your application, meaning that there’s no network overhead (so embedded is much faster). Of course both keep the data, that’s why you have a database to begin with 🙂
You can even use the embedded mode and standalone server at the same time, see: http://docs.neo4j.org/chunked/snapshot/server-embedded.html
For loading lots of data in one go, the BatchInserter should be used, see: http://docs.neo4j.org/chunked/milestone/indexing-batchinsert.html