Data extraction from confluence page

484 Views Asked by At

I want to extract few information from a confluence page. Data is stored in table format inside the confluence.

Is there any way to get this done? Im using python for this scrape.

1

There are 1 best solutions below

1
On

Apparently ... you can connect directly to the Confluence backend database; see

However:

  • I am not aware of any public documentation of the Confluence SQL schemas.
  • You can probably infer the schemas from the Confluence Hibernate (ORM) data model; see https://confluence.atlassian.com/doc/confluence-data-model-127369837.html , but you will need to understand how Hibernate mappings work.
  • I am not aware of any documentation or examples in Python. (Try Googling ...).
  • My recommendation would be to learn Java + Hibernate, or hire a Java dev to do this project for you.
  • I expect this would only work for a self-hosted Confluence installation. In the web-hosted case, you should be using the APIs.
  • Using the Confluence APIs should be simpler than database access. The APIs should be your preferred approach, unless you have real performance requirements1 that can only be met by direct database access.

1 - I am talking about real pragmatic requirements ... not just "I want it to be as efficient as possible 'cos I hate writing inefficient code" or "The boss / client said make it as fast as possible.".