SeqAn3  3.0.3
The Modern C++ library for sequence analysis.
seqan3::sam_dna16 Class Reference

A 16 letter DNA alphabet, containing all IUPAC symbols minus the gap and plus an equality sign ('='). More...

#include <seqan3/alphabet/nucleotide/sam_dna16.hpp>

+ Inheritance diagram for seqan3::sam_dna16:

Public Member Functions

Constructors, destructor and assignment
constexpr sam_dna16 () noexcept=default
 Defaulted.
 
constexpr sam_dna16 (sam_dna16 const &) noexcept=default
 Defaulted.
 
constexpr sam_dna16 (sam_dna16 &&) noexcept=default
 Defaulted.
 
constexpr sam_dna16operator= (sam_dna16 const &) noexcept=default
 Defaulted.
 
constexpr sam_dna16operator= (sam_dna16 &&) noexcept=default
 Defaulted.
 
 ~sam_dna16 () noexcept=default
 Defaulted.
 
- Public Member Functions inherited from seqan3::nucleotide_base< sam_dna16, 16 >
constexpr sam_dna16 complement () const noexcept
 Return the complement of the letter. More...
 
constexpr nucleotide_base (other_nucl_type const &other) noexcept
 Allow explicit construction from any other nucleotide type and convert via the character representation.
 
- Public Member Functions inherited from seqan3::alphabet_base< derived_type, size, char_t >
constexpr alphabet_base () noexcept=default
 Defaulted.
 
constexpr alphabet_base (alphabet_base const &) noexcept=default
 Defaulted.
 
constexpr alphabet_base (alphabet_base &&) noexcept=default
 Defaulted.
 
constexpr alphabet_baseoperator= (alphabet_base const &) noexcept=default
 Defaulted.
 
constexpr alphabet_baseoperator= (alphabet_base &&) noexcept=default
 Defaulted.
 
 ~alphabet_base () noexcept=default
 Defaulted.
 
constexpr char_type to_char () const noexcept
 Return the letter as a character of char_type. More...
 
constexpr rank_type to_rank () const noexcept
 Return the letter's numeric value (rank in the alphabet). More...
 
constexpr derived_type & assign_char (char_type const c) noexcept
 Assign from a character, implicitly converts invalid characters. More...
 
constexpr derived_type & assign_rank (rank_type const c) noexcept
 Assign from a numeric value. More...
 

Private Types

using base_t = nucleotide_base< sam_dna16, 16 >
 The base class.
 

Private Attributes

friend base_t
 Befriend seqan3::nucleotide_base.
 

Static Private Attributes

static constexpr std::array< rank_type, 256 > char_to_rank
 Char to value conversion table. More...
 
static const std::array< sam_dna16, alphabet_sizecomplement_table
 The complement table. More...
 
static constexpr char_type rank_to_char [alphabet_size]
 The representation is the same as in the SAM specifications (which is NOT in alphabetical order). More...
 

Related Functions

(Note that these are not member functions.)

using sam_dna16_vector = std::vector< sam_dna16 >
 Alias for an std::vector of seqan3::sam_dna16.
 
Literals
constexpr sam_dna16 operator""_sam_dna16 (char const c) noexcept
 The seqan3::sam_dna16 char literal. More...
 
sam_dna16_vector operator""_sam_dna16 (char const *s, size_t n)
 The seqan3::sam_dna16 string literal. More...
 

Additional Inherited Members

- Static Public Member Functions inherited from seqan3::nucleotide_base< sam_dna16, 16 >
static constexpr bool char_is_valid (char_type const c) noexcept
 Validate whether a character value has a one-to-one mapping to an alphabet value. More...
 
- Static Public Attributes inherited from seqan3::alphabet_base< derived_type, size, char_t >
static constexpr detail::min_viable_uint_t< size > alphabet_size = size
 The size of the alphabet, i.e. the number of different values it can take. More...
 
- Protected Types inherited from seqan3::alphabet_base< derived_type, size, char_t >
using char_type = std::conditional_t< std::same_as< char_t, void >, char, char_t >
 The char representation; conditional needed to make semi alphabet definitions legal. More...
 
using rank_type = detail::min_viable_uint_t< size - 1 >
 The type of the alphabet when represented as a number (e.g. via to_rank()). More...
 

Detailed Description

A 16 letter DNA alphabet, containing all IUPAC symbols minus the gap and plus an equality sign ('=').

The seqan3::sam_dna16 alphabet is the nucleotide alphabet used inside the SAM, BAM and CRAM formats. It has all the letters of the seqan3::dna15 alphabet and the extra alphabet character '=' which denotes a nucleotide character identical to the reference. Without the context of this reference sequence, no assumptions can be made about the actual value of '=' letter.

Note that you can assign 'U' as a character to sam_dna16 and it will silently be converted to 'T'. Lower case letters are accepted when assigning from char (just like seqan3::dna15) and unknown characters are silently converted to 'N'.

The complement is the same as for seqan3::dna15, with the addition that the complement of '=' is unknown and therefore set to 'N'.

int main()
{
using seqan3::operator""_sam_dna16;
seqan3::sam_dna16 my_letter{'A'_sam_dna16};
my_letter.assign_char('=');
my_letter.assign_char('F'); // unknown characters are implicitly converted to N.
seqan3::debug_stream << my_letter << '\n'; // "N";
}
constexpr derived_type & assign_char(char_type const c) noexcept
Assign from a character, implicitly converts invalid characters.
Definition: alphabet_base.hpp:159
A 16 letter DNA alphabet, containing all IUPAC symbols minus the gap and plus an equality sign ('=').
Definition: sam_dna16.hpp:46
Provides seqan3::debug_stream and related types.
debug_stream_type debug_stream
A global instance of seqan3::debug_stream_type.
Definition: debug_stream.hpp:42
Provides seqan3::sam_dna16.

Friends And Related Function Documentation

◆ operator""_sam_dna16() [1/2]

sam_dna16_vector operator""_sam_dna16 ( char const *  s,
size_t  n 
)
related

The seqan3::sam_dna16 string literal.

Returns
seqan3::sam_dna16_vector
Parameters
[in]sThe string literal to assign from.
[in]nThe length of the string literal s.

You can use this string literal to easily assign to seqan3::sam_dna16_vector:

int main()
{
using seqan3::operator""_sam_dna16;
seqan3::sam_dna16_vector foo{"ACgtTA"_sam_dna16};
seqan3::sam_dna16_vector bar = "ACG==A"_sam_dna16;
auto bax = "A=GTT!"_sam_dna16;
seqan3::debug_stream << foo << "\n" << bar << "\n" << bax << "\n";
}

◆ operator""_sam_dna16() [2/2]

constexpr sam_dna16 operator""_sam_dna16 ( char const  c)
related

The seqan3::sam_dna16 char literal.

Returns
seqan3::sam_dna16
Parameters
[in]cThe character to assign from.

Member Data Documentation

◆ char_to_rank

constexpr std::array<rank_type, 256> seqan3::sam_dna16::char_to_rank
staticconstexprprivate
Initial value:
{
[] () constexpr
{
for (auto & c : ret)
c = 15;
for (size_t rnk = 0u; rnk < alphabet_size; ++rnk)
{
ret[ rank_to_char[rnk] ] = rnk;
ret[to_lower(rank_to_char[rnk])] = rnk;
}
ret['U'] = ret['T']; ret['u'] = ret['t'];
return ret;
}()
}
static constexpr detail::min_viable_uint_t< size > alphabet_size
The size of the alphabet, i.e. the number of different values it can take.
Definition: alphabet_base.hpp:198
static constexpr char_type rank_to_char[alphabet_size]
The representation is the same as in the SAM specifications (which is NOT in alphabetical order).
Definition: sam_dna16.hpp:76
constexpr char_type to_lower(char_type const c) noexcept
Converts 'A'-'Z' to 'a'-'z' respectively; other characters are returned as is.
Definition: transform.hpp:81

Char to value conversion table.

◆ complement_table

constexpr std::array< sam_dna16, sam_dna16::alphabet_size > seqan3::sam_dna16::complement_table
staticconstexprprivate
Initial value:
{
'N'_sam_dna16,
'T'_sam_dna16,
'G'_sam_dna16,
'K'_sam_dna16,
'C'_sam_dna16,
'Y'_sam_dna16,
'S'_sam_dna16,
'B'_sam_dna16,
'A'_sam_dna16,
'W'_sam_dna16,
'R'_sam_dna16,
'D'_sam_dna16,
'M'_sam_dna16,
'H'_sam_dna16,
'V'_sam_dna16,
'N'_sam_dna16
}

The complement table.

◆ rank_to_char

constexpr char_type seqan3::sam_dna16::rank_to_char[alphabet_size]
staticconstexprprivate
Initial value:
{
'=',
'A',
'C',
'M',
'G',
'R',
'S',
'V',
'T',
'W',
'Y',
'H',
'K',
'D',
'B',
'N'
}

The representation is the same as in the SAM specifications (which is NOT in alphabetical order).


The documentation for this class was generated from the following file: