blob: 0b1a5e1dd91fce2c47ed5ca55692f417b9176e39 (
plain)
1
2
3
4
5
6
7
8
|
2009-07-30
After chatting with Bruce D'Arcus <bdarcus@gmail.com> on zotero-dev@googlegroups.com, we came up with an idea for how to format scraping metadata.
(1) scraper metadata file format for mapping xpaths to particular attributes in particular classes
(2) messy data cleanup utils: for cleaning up the results of scraping, common parsing utilities.
- perl/python-style regular expressions
|