Improve the performance of the NL writer - several suggestions have been proposed:
- profile and see if there is low hanging fruit
- look at implementing a binary NL format
- look at implementing a C-based writer
Still need to gather data on these options to see which has the most promise.