Optimize primitive loading time
It seems each individual primitive takes a long time to load. #74 (closed) is addressing this by implementing lazy loading, but we should also investigate how to optimize loading of each individual primitive as well. Some discussion has been done as part of #74 (closed) already.
Some first steps would be to profile what is all run, how often, and how long it takes when you load a primitive.