somehow I have some sequences duplicated in my groups file-I don’t know how that happened. the lines are completely identical
sequencexyz groupA
sequencexyz groupA
any ideas how to find and remove the duplicates? I tried list.seqs on the names then get seqs but it selects both lines since they match the sequence name. This is a huge dataset 7.5M reads post cleaning and trimming so don’t want to just rerun trim.seqs unless there’s no other way