Size: 833
Comment:
|
← Revision 6 as of 2023-04-11 14:35:15 ⇥
Size: 1079
Comment:
|
Deletions are marked like this. | Additions are marked like this. |
Line 2: | Line 2: |
'''`xml.sax`''' is a module for parsing XML. This parser uses the '''SAX''' ('''S'''imple '''A'''PI for '''X'''ML) API. |
|
Line 40: | Line 44: |
---- == See also == [[https://docs.python.org/3/library/xml.sax.html|Python xml.sax module documentation]] |
Python XML SAX
xml.sax is a module for parsing XML.
This parser uses the SAX (Simple API for XML) API.
Contents
Usage
from xml.sax import handler, make_parser class MyHandler(handler.ContentHandler): def __init__(self): handler.ContentHandler.__init__(self) self.in_page = False self.character_buffer = "" def startElement(self, name, attrs): if name == "page": self.in_page = True def endElement(self, name): if name == "page": self.in_page = False print(self.character_buffer) self.character_buffer = "" def characters(self, data): self.character_buffer += data def parse(filename): parser = make_parser() handler = MyHandler() parser.setContentHandler(handler) parser.parse(filename)
See also
Python xml.sax module documentation