Convention 3

In this convention, sequence names begin with one or more digits, followed by “f” for forward or “r” for reverse, followed by zero or more characters, followed by “.abi.” Reads for which the first four digits match comprise a pair. The characters between the “f” or “r” and the period indicate which replicate, if any, the read represents. Therefore, only the first one or more digits need match to qualify two reads as a pair:

 

Forward Names

Reverse Names

1f1.abi

1r.abi

124f.abi

124r.abi, 124n1.abi

2779f3.abi

2779r3.abi

 

An expression used for this naming system is:

 

Forward Name

Reverse Name

(\d+)f(.*)\.abi

(\d+)r(.*)\.abi

 

To be more rigorous and ensure that the matched digits are indeed the first ones preceding the f, the “^” character can be placed at the beginning of the expression:

 

Forward Name

Reverse Name

^(\d+)f(.*)\.abi

^(\d+)r(.*)\.abi