b82f2e7e4fdd155128a44bfbe270ea6d64704edf,kraken/lib/xml.py,,parse_page,#,31

Before Change


        doc = etree.parse(fp)
        image = doc.find(".//{*}Page")
        if image is None or image.get("imageFilename") is None:
            raise KrakenInputException("No valid filename found in PageXML file")
        lines = doc.findall(".//{*}TextLine")
        data = {"image": os.path.join(base_dir, image.get("imageFilename")), "lines": []}
        for line in lines:
            pol = line.find("./{*}Coords")

After Change


            raise KrakenInputException("Parsing {} failed: {}".format(filename, e))
        image = doc.find(".//{*}Page")
        if image is None or image.get("imageFilename") is None:
            raise KrakenInputException("No valid image filename found in PageXML file {}".format(filename))
        lines = doc.findall(".//{*}TextLine")
        data = {"image": os.path.join(base_dir, image.get("imageFilename")), "lines": []}
        for line in lines:
Italian Trulli
In pattern: SUPERPATTERN

Frequency: 3

Non-data size: 3

Instances


Project Name: mittagessen/kraken
Commit Name: b82f2e7e4fdd155128a44bfbe270ea6d64704edf
Time: 2019-10-30
Author: mittagessen@l.unchti.me
File Name: kraken/lib/xml.py
Class Name:
Method Name: parse_page


Project Name: mittagessen/kraken
Commit Name: fd62429f555169ccbdcf98e1c3197452371210a6
Time: 2019-09-05
Author: mittagessen@l.unchti.me
File Name: kraken/lib/xml.py
Class Name:
Method Name: parse_page


Project Name: mittagessen/kraken
Commit Name: e1f05f64e4618a8d76fcf8550af0e23734ef06f5
Time: 2020-10-05
Author: mittagessen@l.unchti.me
File Name: kraken/lib/xml.py
Class Name:
Method Name: parse_alto