remove duplicate entries from groups file

somehow I have some sequences duplicated in my groups file-I don’t know how that happened. the lines are completely identical

sequencexyz groupA
sequencexyz groupA

any ideas how to find and remove the duplicates? I tried list.seqs on the names then get seqs but it selects both lines since they match the sequence name. This is a huge dataset 7.5M reads post cleaning and trimming so don’t want to just rerun trim.seqs unless there’s no other way

If you’re using mac/unix you can do…


sort file.groups | uniq > newfile.groups

thanks