fishy.models.deep.mamba¶
Mamba model for spectral classification.
Classes
- class fishy.models.deep.mamba.Mamba(input_dim: int, output_dim: int, hidden_dim: int = 128, num_layers: int = 4, dropout: float = 0.2, d_state: int = 16, d_conv: int = 4, expand: int = 2, **kwargs)[source]¶
Bases:
ModuleMamba-based model for spectral data classification.
- __init__(input_dim: int, output_dim: int, hidden_dim: int = 128, num_layers: int = 4, dropout: float = 0.2, d_state: int = 16, d_conv: int = 4, expand: int = 2, **kwargs) None[source]¶
Initializes the Mamba model.
- Parameters:
input_dim (int) – Number of input features.
output_dim (int) – Number of output classes.
hidden_dim (int, optional) – Hidden layer dimension. Defaults to 128.
num_layers (int, optional) – Number of Mamba layers. Defaults to 4.
dropout (float, optional) – Dropout rate. Defaults to 0.2.
d_state (int, optional) – State dimension. Defaults to 16.
d_conv (int, optional) – Convolution kernel size. Defaults to 4.
expand (int, optional) – Expansion factor. Defaults to 2.
- class fishy.models.deep.mamba.MambaBlock(d_model: int, d_state: int = 16, d_conv: int = 4, expand: int = 2, dropout: float = 0.2, layer_norm_eps: float = 1e-05)[source]¶
Bases:
ModuleMamba block implementing the inner and outer functions of the Mamba model.
- class fishy.models.deep.mamba.SiameseMamba(input_dim: int, output_dim: int, hidden_dim: int = 128, num_layers: int = 4, dropout: float = 0.2, **kwargs)[source]¶
Bases:
ModuleSiamese Mamba architecture for instance recognition.
- __init__(input_dim: int, output_dim: int, hidden_dim: int = 128, num_layers: int = 4, dropout: float = 0.2, **kwargs) None[source]¶
Initialize internal Module state, shared by both nn.Module and ScriptModule.
- forward(x1: Tensor, x2: Tensor) Tensor | tuple[source]¶
Define the computation performed at every call.
Should be overridden by all subclasses.
Note
Although the recipe for forward pass needs to be defined within this function, one should call the
Moduleinstance afterwards instead of this since the former takes care of running the registered hooks while the latter silently ignores them.
s