I have a .ods file, which looks pretty similar to a usual excel file. This file contains a column with hyperlinks in it. To parse the full document, I've been using the pandas read_excel() method and works perfectly well for the raw data, but the hyperlinks are lost.
Do anyone know of a solution to parse hyperlinks from an .ods file?
Looking on the web, users propose to use openpyxl to load the workbook and extract the hyperlinks, but openpyxl do not support .ods format.
You can try reading the
.odsfile as an archive and parse its content with beautifulsoup :An alternative, with
odfpy:Output :
Input used (
file.ods) :