Size: 920
Comment:
|
Size: 1041
Comment:
|
Deletions are marked like this. | Additions are marked like this. |
Line 42: | Line 42: |
---- == See also === [[https://docs.python.org/3/library/xml.sax.html|Python xml.sax module documentation]] |
Python XML SAX
xml.sax is a module for parsing XML with a SAX (Simple API for XML) parser.
Contents
Usage
from xml.sax import handler, make_parser class MyHandler(handler.ContentHandler): def __init__(self): handler.ContentHandler.__init__(self) self.in_page = False self.character_buffer = "" def startElement(self, name, attrs): if name == "page": self.in_page = True def endElement(self, name): if name == "page": self.in_page = False print(self.character_buffer) self.character_buffer = "" def characters(self, data): self.character_buffer += data def parse(filename): parser = make_parser() handler = MyHandler() parser.setContentHandler(handler) parser.parse(filename)
== See also ===
Python xml.sax module documentation