I have a program that reads in output from an external command and process line by line. However, when the external command outputs infinitely (prints some string in an infinite loop for instance), the program seems to be blocked and delayed - it buffers output and prints everything altogether when terminated. What's the best way to process infinite data input from another process, so that the program can process data "same time" as the external one is writing? Is there any high level API that works with this case, where I don't need to use monitor or other concurrency stuff? Thanks! I code in Scala, so both Scala and Java library would help. Here's Scala code segment.
val pb = new ProcessBuilder("./streamTest.py")
val p = pb.start()
val reader = new BufferedReader(new InputStreamReader(p.getInputStream()))
var line = reader.readLine()
while(line != null) {
println(line)
line = reader.readLine()
}
reader.close()
Same code segment in Java:
ProcessBuilder pb = new ProcessBuilder("./streamTest.py");
Process p = pb.start();
BufferedReader reader = new BufferedReader(new InputStreamReader(p.getInputStream()));
String line = reader.readLine();
while(line != null) {
System.out.println(line);
line = reader.readLine();
}
reader.close();
Here's an example of external script streamTest.py that doesn't work. If I change the command to simpler ones that terminate such as "ls -l", that is, let pb be new ProcessBuilder("ls -l"), the program works fine.
#! /usr/bin/python
import time
while True:
time.sleep(1)
print("Hello World!")
You need to tell Python not to buffer the output.
Following solutions are possible.
manual flush
change buffering mode of stdout
execute the script in unbuffered mode
or
or
Other solutions possible.