Data#
Classes and utilities for manipulating genomics data.
Fold Intervals#
|
Subset of the data. |
Returns the names of all data folds. |
|
Returns the data folds used for the model version. |
|
Returns the training intervals for the model version. |
Genome#
|
Represents the strand of a DNA sequence. |
|
Represents a genomic interval. |
|
Represents a genomic variant/mutation. |
|
Represents a splice junction. |
Gene annotation#
Valid Transcript types available in the GENCODE GTF. |
|
|
Extract transcription start sites (TSS) from a DataFrame. |
Filter GTF entries by transcript types. |
|
Filter GTF entries to only protein-coding genes. |
|
Filter GTF entries to only the longest transcript per gene. |
|
Filter GTF to only transcripts with specific GENCODE support levels. |
Ontology#
|
Supported ontology types. |
|
A single biological ontology term. |
Track data#
|
Container for storing track values and metadata. |
|
Concatenates multiple |
|
Interleaves multiple |
|
Converts track metadata to a |
Creates track metadata from a |
|
|
Creates a |
Transcript#
|
Represents transcript object containing attributes from a GTF file. |
Transcript extractor from gtf. |