Given any arbitrary text file full of printable characters, how can this be converted to HTML that would be rendered exactly the same (with the following requirements)?
- Does not rely on any but the default HTML whitespace rules
- No
<pre>
tag - No CSS
white-space
rules
- No
<p>
tags are fine, but not required (<br />
s and/or<div>
s are fine)Whitespace is maintained exactly.
Given the following lines of input (ignore erroneous auto syntax highlighting):
Line one Line two, indented four spaces
A browser should render the output exactly the same, maintaining the indentation of the second line and the gap between "indented" and "spaces". Of course, I am not actually looking for monospaced output, and the font is orthogonal to the algorithm/markup.
Given the two lines as a complete input file, example correct output would be:
Line one<br /> Line two, indented four spaces
Soft wrapping in the browser is desirable. That is, the resulting HTML should not force the user to scroll, even when input lines are wider than their viewport (assuming individual words are still narrowing than said viewport).
I’m looking for fully defined algorithm. Bonus points for implementation in python or javascript.
(Please do not just answer that I should be using <pre>
tags or a CSS white-space
rule, as my requirements render those options untenable. Please also don’t post untested and/or naïve suggestions such as “replace all spaces with
.” After all, I’m positive a solution is technically possible — it’s an interesting problem, don’t you think?)
The solution to do that while still allowing the browser to wrap long lines is to replace each sequence of two spaces with a space and a non break space.
The browser will correctly render all spaces (normal and non break ones), while still wrapping long lines (due to normal spaces).
Javascript: