Abstract Background Automated methods for assembling families of orthologous genes include those based on sequence similarity scores and those based on phylogenetic approaches.The first are easy to automate but usually they do not distinguish between paralogs and orthologs or have restriction on the Gaiters number of taxa.Phylogenetic methods often