XMLSyntaxError: Start tag expected, '<' not found, line 1, column 1

8.4k Views Asked by At

I am trying to build a script to query the UK National Grid API. Here is my code so far:

 import requests
 import pandas as pd
 from lxml import etree

def getXML(toDate, fromDate, dayType):

  url = "http://marketinformation.natgrid.co.uk/MIPIws-public/public/publicwebservice.asmx"
  headers = {'content type': 'application/soap+xml; charset=utf-8'}

  body="""
  <soap:Envelope xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance" xmlns:xsd="http://www.w3.org/2001/XMLSchema" xmlns:soap="http://schemas.xmlsoap.org/soap/envelope/">
    <soap:Body>
      <GetPublicationDataWM xmlns="http://www.NationalGrid.com/MIPI/">
        <reqObject>
          <LatestFlag>N</LatestFlag>
          <ApplicableForFlag>Y</ApplicableForFlag>
          <ToDate>%s</ToDate>
          <FromDate>%s</FromDate>
          <DateType>%s</DateType>
          <PublicationObjectNameList>
            <string>LNG Stock Level</string>
          </PublicationObjectNameList>
        </reqObject>
      </GetPublicationDataWM>
    </soap:Body>
  </soap:Envelope>
  """ % (toDate,fromDate,dayType)

  response=requests.post(url,data=body,headers=headers)

  return response.content


df = pd.DataFrame(columns=("applicable_at","applicable for","name","value","generated","quality indicator","substituted","created date"))

for pd_date in pd.date_range('2019-01-01','2019-01-03'):
  day = pd_date.strftime('%Y-%m-%d')

  root = etree.fromstring(getXML(day,day,"normal"))

But I am receiving the below error:

  File "<string>", line 1
XMLSyntaxError: Start tag expected, '<' not found, line 1, column 1

I have tried following the advice provided by the National Grid (link below), but keep returning the same error.

http://marketinformation.natgrid.co.uk/MIPIws-public/public/publicwebservice.asmx?op=GetPublicationDataWM

1

There are 1 best solutions below

0
On

You have white space before the first opening "<". This is not allowed.