In this convention, sequence names begin with one or more digits, followed by “f” for forward or “r” for reverse, followed by zero or more characters, followed by “.abi.” Reads for which the first four digits match comprise a pair. The characters between the “f” or “r” and the period indicate which replicate, if any, the read represents. Therefore, only the first one or more digits need match to qualify two reads as a pair:
Forward Names |
Reverse Names |
1f1.abi |
1r.abi |
124f.abi |
124r.abi, 124n1.abi |
2779f3.abi |
2779r3.abi |
An expression used for this naming system is:
Forward Name |
Reverse Name |
(\d+)f(.*)\.abi |
(\d+)r(.*)\.abi |
To be more rigorous and ensure that the matched digits are indeed the first ones preceding the f, the “^” character can be placed at the beginning of the expression:
Forward Name |
Reverse Name |
^(\d+)f(.*)\.abi |
^(\d+)r(.*)\.abi |