BAITS.VDJ.tl.cluster_group

Contents

BAITS.VDJ.tl.cluster_group#

BAITS.VDJ.tl.cluster_group(cdr3_list, threshold=0.85)#

Cluster a list of CDR3 nucleotide sequences based on sequence identity.

Two sequences are clustered together if their pairwise identity is greater than or equal to the specified threshold.

Parameters:
  • cdr3_list (list of str) – List of unique CDR3 nucleotide sequences.

  • threshold (float, default=0.85) – Minimum pairwise identity required to connect two sequences.

Returns:

Each set contains sequences belonging to the same cluster.

Return type:

list of sets