I'm using pandas read_html to read an html file and I'm running into an issue with nonbreaking spaces. I have data in a column of resulting data frame that should contains a string like "ABCDEF G" (three spaces between F and G). Instead I'm getting "ABCDEF G" (one space between F and G). When I inspect the html file it shows "ABCDEF G" so for some reason these three nonbreaking spaces are being changed to one space only. All single nonbreaking spaces in the html are working fine. Is there a way to get around this so it retains the three spaces between F and G?
Pandas read_html issue with  
357 Views Asked by ejyoung At
1
There are 1 best solutions below
Related Questions in PYTHON
- new thread blocks main thread
- Extracting viewCount & SubscriberCount from YouTube API V3 for a given channel, where channelID does not equal userID
- Display images on Django Template Site
- Difference between list() and dict() with generators
- How can I serialize a numpy array while preserving matrix dimensions?
- Protractor did not run properly when using browser.wait, msg: "Wait timed out after XXXms"
- Why is my program adding int as string (4+7 = 47)?
- store numpy array in mysql
- how to omit the less frequent words from a dictionary in python?
- Update a text file with ( new words+ \n ) after the words is appended into a list
- python how to write list of lists to file
- Removing URL features from tokens in NLTK
- Optimizing for Social Leaderboards
- Python : Get size of string in bytes
- What is the code of the sorted function?
Related Questions in HTML
- Delay in loading Html Page(WebView) from assets folder in real android device
- Why does a function show up as not defined
- CSS Class is not applying to element (border width,color,and style attributes)
- How to sort these using Javascript or Jquery Most effectively
- how to fill out the table with next values in array with one button
- Automatically closing tags in form input?
- Positioning child at bottom of parent with scroll
- Remove added set of rows
- Website zoomed out on Android default browser
- Twitter Bootstrap horizontal form elements on a line
- http://sigmajs.org/ les mis tutorial - why are my canvases 0 height?
- My navbar is not expanding after collapse
- when a checkbox is checked how to display a different hidden element using javascript
- Gaps Vertically Using Dividers
- Svg containers not positioning properly
Related Questions in PANDAS
- object of type 'float' has no len() when using to_stata
- Pandas date ranges and averaging the counts
- Using Pandas how do I deduplicate a file being read in chunks?
- How to count distance to the previous zero in pandas series?
- Succint way of handling missing observations in numpy.cov?
- Pandas and GeoPandas indexing and slicing
- convert kenneth French data to daily datetime format in python
- keep timezone "CET" from convert into "CEST" in python
- Calculating the difference in dates in a Pandas GroupBy object
- python.exe crashes down while interpreting 'read_csv' command of pandas library
- Column is not appended to pandas DataFrame
- reshaping and rearranging a pandas table
- csv parsing and manipulation using python
- Using StringIO with pandas.read_csv keyword arguments
- Pandas is installed but import pandas throws error
Related Questions in NON-BREAKING-CHARACTERS
- Powershell Get-Content file not found after Get-ChildItem
- in Python 3.6 - getting text using an XPath expression
- Non breaking space after an href link is being included in the displayed hyperlink
- Non-breaking spaces for use in HTML and HTML-formatted email
- How can I make testing-library's getByText() match a string including a non-breaking space ( )?
- remove non breaking space in power query
- jQuery html() function and
- NON-BREAKING HYPHEN in Word interop C# Range
- Cannot get text as in GUI
- power automate flow - html to text - odd new line
- How to unify the space width in different browsers
- Encoding issue causing difference in strings
- Is there any way to set the width of ` ` or `&nnbsp;` to zero?
- Replacing characters in R string based on raw hex values
- How can I find the exact value using xpath in selenium webdriver for text that contains ?
Trending Questions
- UIImageView Frame Doesn't Reflect Constraints
- Is it possible to use adb commands to click on a view by finding its ID?
- How to create a new web character symbol recognizable by html/javascript?
- Why isn't my CSS3 animation smooth in Google Chrome (but very smooth on other browsers)?
- Heap Gives Page Fault
- Connect ffmpeg to Visual Studio 2008
- Both Object- and ValueAnimator jumps when Duration is set above API LvL 24
- How to avoid default initialization of objects in std::vector?
- second argument of the command line arguments in a format other than char** argv or char* argv[]
- How to improve efficiency of algorithm which generates next lexicographic permutation?
- Navigating to the another actvity app getting crash in android
- How to read the particular message format in android and store in sqlite database?
- Resetting inventory status after order is cancelled
- Efficiently compute powers of X in SSE/AVX
- Insert into an external database using ajax and php : POST 500 (Internal Server Error)
Popular Questions
- How do I undo the most recent local commits in Git?
- How can I remove a specific item from an array in JavaScript?
- How do I delete a Git branch locally and remotely?
- Find all files containing a specific text (string) on Linux?
- How do I revert a Git repository to a previous commit?
- How do I create an HTML button that acts like a link?
- How do I check out a remote Git branch?
- How do I force "git pull" to overwrite local files?
- How do I list all files of a directory?
- How to check whether a string contains a substring in JavaScript?
- How do I redirect to another webpage?
- How can I iterate over rows in a Pandas DataFrame?
- How do I convert a String to an int in Java?
- Does Python have a string 'contains' substring method?
- How do I check if a string contains a specific word?
It's not elegant but for now I'm doing
Then coming back and replacing the underscores with three spaces. Still looking for a better way to do this but it should work for now.