How to parse large files using flatpack

811 Views Asked by user1723105 At 17 August 2025 at 14:10

I need to parse files that may be quite large, possibly 100s of megabytes and millions of lines. I have been trying to do this using FlatPack. I would think the way to do this would be to use the buffered parsers and the new stream methods. But, despite that dataset.next() returns true for the correct number of records, the Optional returned by dataset.getRecord() never contains a value.

I have looked at this example/test but it only counts the number of record and does not actually do anything with the content. example/test

Original Q&A

There are 2 best solutions below

diogopontual On 14 December 2015 at 18:48

You can use the class BuffReaderParseFactory instead of DefaultParserFactory.

It will read one record from the input file only when you call "next()".

Kara On 07 April 2016 at 06:57

The explanations for both DefaultParserFactory and BuffReaderParseFactory are not exactly helpful. Both libraries said to return PZParser (from newDelimitedParser) but only one of them returns an actual value from a record. Based on the examples I've seen, I think BuffReaderParseFactory is just for checking performance (hence should be faster) and DefaultParserFactory on the other hand contains all the records.

How to parse large files using flatpack

There are 2 best solutions below

Related Questions in JAVA

Related Questions in FLATPACK

Trending Questions

Popular # Hahtags

Popular Questions