Laboratory of Mathematic methods and models in bioinformatics,
Русский :: English
Identification of highly conserved elements (HCEs) in the set of genomes
The software package iHCE implements the method described in . It is aimed at the identification of HCEs in the set of relatively well assembled complete genomes. These programs have been evaluated on nuclear genomes of the superphylum Alveolata  as well as mitochondrial genomes of infusoria (the phylum Ciliophora)  and monocotyledonous plants. The package consists of the following programs intended for MPI-enabled supercomputer:
These programs assume the processing of Big Data and are intended for only 64-bit CPUs and operating systems. Different stages of the algorithm have different computational complexity and scalability. This is why the package has been split into separate programs. In order to decrease the size of files and speed up computation, the programs often use specific data formats almost without any checks. The user is fully responsible for correct compiling and interpreting of such formatted files. For example, he might create a database to store the source data in any desired format and develop himself a database application or script to get source files in the required format. This is a way we used, but we do not describe it in detail.
All programs are written in C++ and have the command line interface to specify most important parameters. Settings made in the command line have the highest priority. All adjustable parameters can be set in the configuration file, which is required and used by these programs for all parameters except modified through the command line. If the parameter is specified in neither command line nor configuration file, the default value will be set in a program, though not for every parameter. A template configuration file is provided in downloadable examples below. Short help on the command line options will be displayed if running the program with argument -? or --help.
Windows 64-bit executables (variants with and without MPI) and source codes for Linux can be downloaded through the below links. The source codes are compatible with most implementations of MPI v.1.2 and above; they are provided under the GNU General Public License (GPL) v.3.
 Rubanov L.I., Seliverstov A.V., Zverkov O.A. and Lyubetsky V.A. Method for identification of highly conserved elements and evolutionary analysis of superphylum Alveolata. (2016) BMC Bioinformatics 17:385. Open Access« back