Skip to content

Commit 0b8bcfa

Browse files
committed
Fix a series of issues found during testing. This is the patch provide
to Yaco on 2011-03-19, and committed on branch/yaco/idsubmit as [2896]. * Extraction of Title which don't have the draft name on a separate page fails. See for instance this example: http://www.ietf.org/staging/draft-ma-cdni-publisher-use-cases-00.txt The regex should maybe be updated to permit but not require a newline before the draft filename: '(?:\n\s*\n\s*)((.+\n){1,2}(.+\n?))(\s+<?draft-\S+\s*\n)\s*\n' * If there are blank lines before the start of the author list on the first page, the author extraction will fail. This sometimes happens when there's junk at the start of a draft, see for instance http://www.ietf.org/id/draft-ietf-mpls-tp-process-00.txt . * Sometimes the Authors' Addresses section lists authors with the same workplace address on the same line: "Sam Spade and Joe Smith". This needs a fix in the author extraction code. * Sometimes the order of first name, surname is different on the first page and in the author list, and sometimes the surname is uppercase in one place, but not in the other. This also needs a fix in the author extraction code. * The header stripping code had a bug, where multiple blank lines could be replaced by a single blank line in the stripped text, which could mess up title extraction. * Title space normalization should be done also for titles from the 'unusual title format' code branch of the title extraction code. * Company names on the first page are sometimes rendered with different case than in the Authors' Addresses section. * Some drafts list the draft filename _before_ the title, rather than after the title. Permit this too. Covered in the patch. * Spanish names can be shown as either <given_name> <fathers_first_surname> <mothers_first_surname> or less formally as <given_name> <fathers_first_surname> If the first form is used in the Authors' Addresses section, but the second form (with the given name possibly abbreviated to its first letter) the author extraction will fail. * Drafts containing tabs will be caught by idnits during I-D submission, but in case the drafts.py module is used independently from idnits, convert tabs to spaces in order for the author extraction and other methods to work as expected. Example: recently submitted draft draft-bergeron-payload-rtpfec-rs-00.txt. * Found a draft with a previously unhandled header/footer format: draft-fang-mpls-tp-oam-toolset-01.txt. Tweak needed for header/footer stripping. - Legacy-Id: 2919 Note: SVN reference [2896] has been migrated to Git commit 5a34b70
1 parent 61300a9 commit 0b8bcfa

1 file changed

Lines changed: 175 additions & 87 deletions

File tree

0 commit comments

Comments
 (0)