publications | Guillaume Lajoie

2025

Generalizable, real-time neural decoding with hybrid state-space models

Avery Hee-Woon Ryoo, Nanda H. Krishna, Ximeng Mao, Mehdi Azabou, Eva L. Dyer, and 2 more authors

2025

@misc{ryoo2025generalizablerealtimeneuraldecoding,
  title = {Generalizable, real-time neural decoding with hybrid state-space models},
  author = {Ryoo, Avery Hee-Woon and Krishna, Nanda H. and Mao, Ximeng and Azabou, Mehdi and Dyer, Eva L. and Perich, Matthew G. and Lajoie, Guillaume},
  year = {2025},
  url = {https://arxiv.org/abs/2506.05320},
  eprint = {2506.05320},
  archiveprefix = {arXiv},
  primaryclass = {q-bio.NC},
}

arXiv

MesaNet: Sequence Modeling by Locally Optimal Test-Time Training

Johannes Oswald, Nino Scherrer, Seijin Kobayashi, Luca Versari, Songlin Yang, and 11 more authors

2025

arXiv Bib

@misc{vonoswald2025mesanetsequencemodelinglocally,
  title = {MesaNet: Sequence Modeling by Locally Optimal Test-Time Training},
  author = {von Oswald, Johannes and Scherrer, Nino and Kobayashi, Seijin and Versari, Luca and Yang, Songlin and Schlegel, Maximilian and Maile, Kaitlin and Schimpf, Yanick and Sieberling, Oliver and Meulemans, Alexander and Saurous, Rif A. and Lajoie, Guillaume and Frenkel, Charlotte and Pascanu, Razvan and y Arcas, Blaise Agüera and Sacramento, João},
  year = {2025},
  url = {https://arxiv.org/abs/2506.05233},
  eprint = {2506.05233},
  archiveprefix = {arXiv},
  primaryclass = {cs.LG},
}

arXiv

Bidirectional Information Flow (BIF) – A Sample Efficient Hierarchical Gaussian Process for Bayesian Optimization

Juan D. Guerra, Thomas Garbay, Guillaume Lajoie, and Marco Bonizzato

2025

arXiv Bib

@misc{guerra2025bidirectionalinformationflowbif,
  title = {Bidirectional Information Flow (BIF) -- A Sample Efficient Hierarchical Gaussian Process for Bayesian Optimization},
  author = {Guerra, Juan D. and Garbay, Thomas and Lajoie, Guillaume and Bonizzato, Marco},
  year = {2025},
  url = {https://arxiv.org/abs/2505.11294},
  eprint = {2505.11294},
  archiveprefix = {arXiv},
  primaryclass = {cs.LG},
}

ICLR

Multi-agent cooperation through learning-aware policy gradients

Alexander Meulemans, Seijin Kobayashi, Johannes Von Oswald, Nino Scherrer, Eric Elmoznino, and 4 more authors

In The Thirteenth International Conference on Learning Representations, 2025

URL Bib

@inproceedings{meulemans2025multi-agent,
  title = {Multi-agent cooperation through learning-aware policy gradients},
  author = {Meulemans, Alexander and Kobayashi, Seijin and Oswald, Johannes Von and Scherrer, Nino and Elmoznino, Eric and Richards, Blake Aaron and Lajoie, Guillaume and y Arcas, Blaise Aguera and Sacramento, Joao},
  year = {2025},
  booktitle = {The Thirteenth International Conference on Learning Representations},
  url = {https://openreview.net/forum?id=GkWA6NjePN},
}

ICLR

Accelerating Training with Neuron Interaction and Nowcasting Networks

Boris Knyazev, Abhinav Moudgil, Guillaume Lajoie, Eugene Belilovsky, and Simon Lacoste-Julien

In The Thirteenth International Conference on Learning Representations, 2025

URL Bib

@inproceedings{knyazev2025accelerating,
  title = {Accelerating Training with Neuron Interaction and Nowcasting Networks},
  author = {Knyazev, Boris and Moudgil, Abhinav and Lajoie, Guillaume and Belilovsky, Eugene and Lacoste-Julien, Simon},
  year = {2025},
  booktitle = {The Thirteenth International Conference on Learning Representations},
  url = {https://openreview.net/forum?id=cUFIil6hEG},
}

ICLR

Expressivity of Neural Networks with Random Weights and Learned Biases

Ezekiel Williams, Alexandre Payeur, Avery Hee-Woon Ryoo, Thomas Jiralerspong, Matthew G Perich, and 2 more authors

In The Thirteenth International Conference on Learning Representations, 2025

URL Bib

@inproceedings{williams2025expressivity,
  title = {Expressivity of Neural Networks with Random Weights and Learned Biases},
  author = {Williams, Ezekiel and Payeur, Alexandre and Ryoo, Avery Hee-Woon and Jiralerspong, Thomas and Perich, Matthew G and Mazzucato, Luca and Lajoie, Guillaume},
  year = {2025},
  booktitle = {The Thirteenth International Conference on Learning Representations},
  url = {https://openreview.net/forum?id=5xwx1Myosu},
}

ICASSP
Latent Representation Learning for Multimodal Brain Activity Translation

Arman Afrasiyabi, Dhananjay Bhaskar, Erica L. Busch, Laurent Caplette, Rahul Singh, and 3 more authors

In ICASSP 2025 - 2025 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), May 2025

Abs DOI Bib

Neuroscience employs diverse neuroimaging techniques, each offering distinct insights into brain activity, from electrophysiological recordings such as EEG, which have high temporal resolution, to hemodynamic modalities such as fMRI, which have increased spatial precision. However, integrating these heterogeneous data sources remains a challenge, which limits a comprehensive understanding of brain function. We present the Spatiotemporal Alignment of Multimodal Brain Activity (SAMBA) framework, which bridges the spatial and temporal resolution gaps across modalities by learning a unified latent space free of modality-specific biases. SAMBA introduces a novel attention-based wavelet decomposition for spectral filtering of electrophysiological recordings, graph attention networks to model functional connectivity between functional brain units, and recurrent layers to capture temporal autocorrelations in brain signal. We show that the training of SAMBA, aside from achieving translation, also learns a rich representation of brain information processing. We showcase this classify external stimuli driving brain activity from the representation learned in hidden layers of SAMBA, paving the way for broad downstream applications in neuroscience research and clinical contexts.
@inproceedings{afrasiyabi2025latent, title = {Latent Representation Learning for Multimodal Brain Activity Translation}, author = {Afrasiyabi, Arman and Bhaskar, Dhananjay and Busch, Erica L. and Caplette, Laurent and Singh, Rahul and Lajoie, Guillaume and Turk-Browne, Nicholas B. and Krishnaswamy, Smita}, year = {2025}, month = may, booktitle = {ICASSP 2025 - 2025 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)}, pages = {1--5}, doi = {10.1109/icassp49660.2025.10887834}, issn = {2379-190x}, keywords = {Wavelet transforms;Training;Translation;Neuroscience;Soft sensors;Transformers;Spatiotemporal phenomena;Recording;Spatial resolution;Speech processing}, }

arXiv

In-Context Parametric Inference: Point or Distribution Estimators?

Sarthak Mittal, Yoshua Bengio, Nikolay Malkin, and Guillaume Lajoie

May 2025

arXiv Bib

@misc{mittal2025in-context,
  title = {In-Context Parametric Inference: Point or Distribution Estimators?},
  author = {Mittal, Sarthak and Bengio, Yoshua and Malkin, Nikolay and Lajoie, Guillaume},
  year = {2025},
  url = {https://arxiv.org/abs/2502.11617},
  eprint = {2502.11617},
  archiveprefix = {arXiv},
  primaryclass = {cs.LG},
}

arXiv

Amortized In-Context Bayesian Posterior Estimation

Sarthak Mittal, Niels Leif Bracher, Guillaume Lajoie, Priyank Jaini, and Marcus Brubaker

May 2025

arXiv Bib

@misc{mittal2025amortized,
  title = {Amortized In-Context Bayesian Posterior Estimation},
  author = {Mittal, Sarthak and Bracher, Niels Leif and Lajoie, Guillaume and Jaini, Priyank and Brubaker, Marcus},
  year = {2025},
  url = {https://arxiv.org/abs/2502.06601},
  eprint = {2502.06601},
  archiveprefix = {arXiv},
  primaryclass = {cs.LG},
}

KBS
Robust prior-biased acquisition function for human-in-the-loop Bayesian optimization

Rose Guay-Hottin, Lison Kardassevitch, Hugo Pham, Guillaume Lajoie, and Marco Bonizzato

Knowledge-Based Systems, May 2025

Abs DOI Bib

In diverse fields of application, Bayesian Optimization (BO) has been proposed to find the optimum of black-box functions, surpassing human-driven searches. BO’s appeal lies in its data efficiency, making it suitable for optimizing costly-to-evaluate functions without requiring extensive training data. While BO can perform well in closed-loop, domain experts frequently have hypotheses about which parameter combinations are more likely to yield optimal results. Hence, for BO to be truly relevant and adopted by practitioners, such prior knowledge needs to be efficiently and seamlessly integrated into the optimization framework. Some methods were recently developed to address this challenge, but they suffer from robustness issues when provided erroneous insight. Building on the idea of element-wise prior-weighted acquisition function, we propose to use a fixed-weight effective prior that distills expert user knowledge with minimal computational cost. Comprehensive investigation across diverse task conditions and prior quality levels revealed that our method, \ensuremathα-\ensuremath\piBO, surpasses Vanilla BO when provided with insights of good quality while maintaining robustness against misleading information. Moreover, unlike other methods, \ensuremathα-\ensuremath\piBO typically requires no hyperparameter tuning, largely simplifying its implementation in diverse tasks.
@article{guay-hottin2025robust, title = {Robust prior-biased acquisition function for human-in-the-loop Bayesian optimization}, author = {Guay-Hottin, Rose and Kardassevitch, Lison and Pham, Hugo and Lajoie, Guillaume and Bonizzato, Marco}, year = {2025}, journal = {Knowledge-Based Systems}, volume = {311}, pages = {113039}, doi = {10.1016/j.knosys.2025.113039}, issn = {0950-7051}, url = {https://www.sciencedirect.com/science/article/pii/S0950705125000863}, keywords = {Bayesian optimization, Domain knowledge integration, Prior-weighted acquisition function, Region of interest, Human-in-the-loop}, }

arXiv

Learning Versatile Optimizers on a Compute Diet

Abhinav Moudgil, Boris Knyazev, Guillaume Lajoie, and Eugene Belilovsky

May 2025

arXiv Bib

@misc{moudgil2025learning,
  title = {Learning Versatile Optimizers on a Compute Diet},
  author = {Moudgil, Abhinav and Knyazev, Boris and Lajoie, Guillaume and Belilovsky, Eugene},
  year = {2025},
  url = {https://arxiv.org/abs/2501.12670},
  eprint = {2501.12670},
  archiveprefix = {arXiv},
  primaryclass = {cs.LG},
}

bioRxiv
Accelerated learning of a noninvasive human brain-computer interface via manifold geometry

Erica L. Busch, E. Chandra Fincke, Guillaume Lajoie, Smita Krishnaswamy, and Nicholas B. Turk-Browne

bioRxiv, May 2025

Abs DOI Bib

Brain-computer interfaces (BCIs) promise to restore and enhance a wide range of human capabilities. However, a barrier to the adoption of BCIs is how long it can take users to learn to control them. We hypothesized that human BCI learning could be accelerated by leveraging the naturally occurring geometric structure of brain activity, or its intrinsic manifold, extracted using a data-diffusion process. We trained participants on a noninvasive BCI that allowed them to gain real-time control of an avatar in a virtual reality game by modulating functional magnetic resonance imaging (fMRI) activity in brain regions that support spatial navigation. We then perturbed the mapping between fMRI activity patterns and the movement of the avatar to test our manifold hypothesis. When the new mapping respected the intrinsic manifold, participants succeeded in regaining control of the BCI by aligning their brain activity within the manifold. When the new mapping sdid not respect the intrinsic manifold, participants could not learn to control the avatar again. These findings show that the manifold geometry of brain activity constrains human learning of a complex cognitive task in higher-order brain regions. Manifold geometry may be a critical ingredient for unlocking the potential of future human neurotechnologies.Competing Interest StatementThe authors have declared no competing interest.
@article{busch2025accelerated, title = {Accelerated learning of a noninvasive human brain-computer interface via manifold geometry}, author = {Busch, Erica L. and Fincke, E. Chandra and Lajoie, Guillaume and Krishnaswamy, Smita and Turk-Browne, Nicholas B.}, year = {2025}, journal = {bioRxiv}, publisher = {Cold Spring Harbor Laboratory}, doi = {10.1101/2025.03.29.646109}, url = {https://www.biorxiv.org/content/early/2025/04/03/2025.03.29.646109}, elocation-id = {2025.03.29.646109}, eprint = {https://www.biorxiv.org/content/early/2025/04/03/2025.03.29.646109.full.pdf}, }

arXiv

A Complexity-Based Theory of Compositionality

Eric Elmoznino, Thomas Jiralerspong, Yoshua Bengio, and Guillaume Lajoie

May 2025

arXiv Bib

@misc{elmoznino2025complexity-based,
  title = {A Complexity-Based Theory of Compositionality},
  author = {Elmoznino, Eric and Jiralerspong, Thomas and Bengio, Yoshua and Lajoie, Guillaume},
  year = {2025},
  url = {https://arxiv.org/abs/2410.14817},
  eprint = {2410.14817},
  archiveprefix = {arXiv},
  primaryclass = {cs.CL},
}

arXiv

In-context learning and Occam’s razor

Eric Elmoznino, Tom Marty, Tejas Kasetty, Leo Gagnon, Sarthak Mittal, and 3 more authors

May 2025

arXiv Bib

@misc{elmoznino2025in-context,
  title = {In-context learning and Occam's razor},
  author = {Elmoznino, Eric and Marty, Tom and Kasetty, Tejas and Gagnon, Leo and Mittal, Sarthak and Fathi, Mahan and Sridhar, Dhanya and Lajoie, Guillaume},
  year = {2025},
  url = {https://arxiv.org/abs/2410.14086},
  eprint = {2410.14086},
  archiveprefix = {arXiv},
  primaryclass = {cs.LG},
}

bioRxiv
The oneirogen hypothesis: modeling the hallucinatory effects of classical psychedelics in terms of replay-dependent plasticity mechanisms

Colin Bredenberg, Fabrice Normandin, Blake Richards, and Guillaume Lajoie

bioRxiv, May 2025

Abs DOI Bib

Classical psychedelics induce complex visual hallucinations in humans, generating percepts that are co-herent at a low level, but which have surreal, dream-like qualities at a high level. While there are many hypotheses as to how classical psychedelics could induce these effects, there are no concrete mechanistic models that capture the variety of observed effects in humans, while remaining consistent with the known pharmacological effects of classical psychedelics on neural circuits. In this work, we propose the “oneirogen hypothesis”, which posits that the perceptual effects of classical psychedelics are a result of their pharmacological actions inducing neural activity states that truly are more similar to dream-like states. We simulate classical psychedelics’ effects via manipulating neural network models trained on perceptual tasks with the Wake-Sleep algorithm. This established machine learning algorithm leverages two activity phases, a perceptual phase (wake) where sensory inputs are encoded, and a generative phase (dream) where the network internally generates activity consistent with stimulus-evoked responses. We simulate the action of psychedelics by partially shifting the model to the ‘Sleep’ state, which entails a greater influence of top-down connections, in line with the impact of psychedelics on apical dendrites. The effects resulting from this manipulation capture a number of experimentally observed phenomena including the emergence of hallucinations, increases in stimulus-conditioned variability, and large increases in synaptic plasticity. We further provide a number of testable predictions which could be used to validate or invalidate our oneirogen hypothesis.Competing Interest StatementThe authors have declared no competing interest.
@article{bredenberg2025oneirogen, title = {The oneirogen hypothesis: modeling the hallucinatory effects of classical psychedelics in terms of replay-dependent plasticity mechanisms}, author = {Bredenberg, Colin and Normandin, Fabrice and Richards, Blake and Lajoie, Guillaume}, year = {2025}, journal = {bioRxiv}, publisher = {Cold Spring Harbor Laboratory}, doi = {10.1101/2024.09.27.615483}, url = {https://www.biorxiv.org/content/early/2025/01/13/2024.09.27.615483}, elocation-id = {2024.09.27.615483}, eprint = {https://www.biorxiv.org/content/early/2025/01/13/2024.09.27.615483.full.pdf}, }
bioRxiv
Assistive sensory-motor perturbations influence learned neural representations

Pavithra Rajeswaran, Alexandre Payeur, Guillaume Lajoie, and Amy L. Orsborn

bioRxiv, May 2025

Abs DOI Bib

Task errors are used to learn and refine motor skills. We investigated how task assistance influences learned neural representations using Brain-Computer Interfaces (BCIs), which map neural activity into movement via a decoder. We analyzed motor cortex activity as monkeys practiced BCI with a decoder that adapted to improve or maintain performance over days. Over time, task-relevant information became concentrated in fewer neurons, unlike with fixed decoders. At the population level, task information also became largely confined to a few neural modes that accounted for an unexpectedly small fraction of the population variance. A neural network model suggests the adaptive decoders directly contribute to forming these more compact neural representations. Our findings show that assistive decoders manipulate error information used for long-term learning computations like credit assignment, which informs our understanding of motor learning and has implications for designing real-world BCIs.Competing Interest StatementA.L.O. is a scientific advisor for Meta Reality Labs. G.L. is a scientific advisor for BIOS Health.
@article{rajeswaran2025assistive, title = {Assistive sensory-motor perturbations influence learned neural representations}, author = {Rajeswaran, Pavithra and Payeur, Alexandre and Lajoie, Guillaume and Orsborn, Amy L.}, year = {2025}, journal = {bioRxiv}, publisher = {Cold Spring Harbor Laboratory}, doi = {10.1101/2024.03.20.585972}, url = {https://www.biorxiv.org/content/early/2025/04/02/2024.03.20.585972}, elocation-id = {2024.03.20.585972}, eprint = {https://www.biorxiv.org/content/early/2025/04/02/2024.03.20.585972.full.pdf}, }

2024

bioRxiv
Brain-like neural dynamics for behavioral control develop through reinforcement learning

Olivier Codol, Nanda H. Krishna, Guillaume Lajoie, and Matthew G. Perich

bioRxiv, May 2024

Abs DOI Bib

During development, neural circuits are shaped continuously as we learn to control our bodies. The ultimate goal of this process is to produce neural dynamics that enable the rich repertoire of behaviors we perform with our limbs. What begins as a series of “babbles” coalesces into skilled motor output as the brain rapidly learns to control the body. However, the nature of the teaching signal underlying this normative learning process remains elusive. Here, we test two well-established and biologically plausible theories—supervised learning (SL) and reinforcement learning (RL)—that could explain how neural circuits develop the capacity for skilled movements. We trained recurrent neural networks to control a biomechanical model of a primate arm using either SL or RL and compared the resulting neural dynamics to populations of neurons recorded from the motor cortex of monkeys performing the same movements. Intriguingly, only RL-trained networks produced neural activity that matched their biological counterparts in terms of both the geometry and dynamics of population activity. We show that the similarity between RL-trained networks and biological brains depends critically on matching biomechanical properties of the limb. We then demonstrated that monkeys and RL-trained networks, but not SL-trained networks, show a strikingly similar capacity for robust short-term behavioral adaptation to a movement perturbation, indicating a fundamental and general commonality in the neural control policy. Together, our results support the hypothesis that neural dynamics for behavioral control emerge through a process akin to reinforcement learning. The resulting neural circuits offer numerous advantages for adaptable behavioral control over simpler and more efficient learning rules and expand our understanding of how developmental processes shape neural dynamics.Competing Interest StatementThe authors have declared no competing interest.
@article{codol2024brain-like, title = {Brain-like neural dynamics for behavioral control develop through reinforcement learning}, author = {Codol, Olivier and Krishna, Nanda H. and Lajoie, Guillaume and Perich, Matthew G.}, year = {2024}, journal = {bioRxiv}, publisher = {Cold Spring Harbor Laboratory}, doi = {10.1101/2024.10.04.616712}, url = {https://www.biorxiv.org/content/early/2024/10/06/2024.10.04.616712}, elocation-id = {2024.10.04.616712}, eprint = {https://www.biorxiv.org/content/early/2024/10/06/2024.10.04.616712.full.pdf}, }
PLOS Comp Bio
Neural networks with optimized single-neuron adaptation uncover biologically plausible regularization

Victor Geadah, Stefan Horoi, Giancarlo Kerg, Guy Wolf, and Guillaume Lajoie

PLOS Computational Biology, Dec 2024

Abs DOI Bib

Neurons in the brain have rich and adaptive input-output properties. Features such as heterogeneous f-I curves and spike frequency adaptation are known to place single neurons in optimal coding regimes when facing changing stimuli. Yet, it is still unclear how brain circuits exploit single-neuron flexibility, and how network-level requirements may have shaped such cellular function. To answer this question, a multi-scaled approach is needed where the computations of single neurons and neural circuits must be considered as a complete system. In this work, we use artificial neural networks to systematically investigate single-neuron input-output adaptive mechanisms, optimized in an end-to-end fashion. Throughout the optimization process, each neuron has the liberty to modify its nonlinear activation function parametrized to mimic f-I curves of biological neurons, either by learning an individual static function or via a learned and shared adaptation mechanism to modify activation functions in real-time during a task. We find that such adaptive networks show much-improved robustness to noise and changes in input statistics. Using tools from dynamical systems theory, we analyze the role of these emergent single-neuron properties and argue that neural diversity and adaptation play an active regularization role, enabling neural circuits to optimally propagate information across time. Finally, we outline similarities between these optimized solutions and known coding strategies found in biological neurons, such as gain scaling and fractional order differentiation/integration.
@article{geadah2024neural, title = {Neural networks with optimized single-neuron adaptation uncover biologically plausible regularization}, author = {Geadah, Victor and Horoi, Stefan and Kerg, Giancarlo and Wolf, Guy and Lajoie, Guillaume}, year = {2024}, month = dec, journal = {PLOS Computational Biology}, publisher = {Public Library of Science}, volume = {20}, number = {12}, pages = {1--23}, doi = {10.1371/journal.pcbi.1012567}, url = {https://doi.org/10.1371/journal.pcbi.1012567}, }
bioRxiv
Brain-like learning with exponentiated gradients

Jonathan Cornford, Roman Pogodin, Arna Ghosh, Kaiwen Sheng, Brendan A. Bicknell, and 4 more authors

bioRxiv, Dec 2024

Abs DOI Bib

Computational neuroscience relies on gradient descent (GD) for training artificial neural network (ANN) models of the brain. The advantage of GD is that it is effective at learning difficult tasks. However, it produces ANNs that are a poor phenomenological fit to biology, making them less relevant as models of the brain. Specifically, it violates Dale’s law, by allowing synapses to change from excitatory to inhibitory, and leads to synaptic weights that are not log-normally distributed, contradicting experimental data. Here, starting from first principles of optimisation theory, we present an alternative learning algorithm, exponentiated gradient (EG), that respects Dale’s Law and produces log-normal weights, without losing the power of learning with gradients. We also show that in biologically relevant settings EG outperforms GD, including learning from sparsely relevant signals and dealing with synaptic pruning. Altogether, our results show that EG is a superior learning algorithm for modelling the brain with ANNs.Competing Interest StatementThe authors have declared no competing interest.
@article{cornford2024brain-like, title = {Brain-like learning with exponentiated gradients}, author = {Cornford, Jonathan and Pogodin, Roman and Ghosh, Arna and Sheng, Kaiwen and Bicknell, Brendan A. and Codol, Olivier and Clark, Beverley A. and Lajoie, Guillaume and Richards, Blake A.}, year = {2024}, journal = {bioRxiv}, publisher = {Cold Spring Harbor Laboratory}, doi = {10.1101/2024.10.25.620272}, url = {https://www.biorxiv.org/content/early/2024/10/26/2024.10.25.620272}, elocation-id = {2024.10.25.620272}, eprint = {https://www.biorxiv.org/content/early/2024/10/26/2024.10.25.620272.full.pdf}, }

PsyArXiv

Task-Optimized Artificial Neural Networks Align with Human Brain Activity in a Visual Working Memory Task

Pravish Sainath, Guillaume Lajoie, and Pierre Bellec

PsyArXiv, Dec 2024

DOI Bib

@article{sainath2024task-optimized,
  title = {Task-Optimized Artificial Neural Networks Align with Human Brain Activity in a Visual Working Memory Task},
  author = {Sainath, Pravish and Lajoie, Guillaume and Bellec, Pierre},
  year = {2024},
  journal = {PsyArXiv},
  doi = {10.31234/osf.io/7g9ej_v1},
  url = {http://dx.doi.org/10.31234/osf.io/7g9ej_v1},
}

arXiv

When can transformers compositionally generalize in-context?

Seijin Kobayashi, Simon Schug, Yassir Akram, Florian Redhardt, Johannes Oswald, and 3 more authors

Dec 2024

arXiv Bib

@misc{kobayashi2024when,
  title = {When can transformers compositionally generalize in-context?},
  author = {Kobayashi, Seijin and Schug, Simon and Akram, Yassir and Redhardt, Florian and von Oswald, Johannes and Pascanu, Razvan and Lajoie, Guillaume and Sacramento, Jo\~{a}o},
  year = {2024},
  url = {https://arxiv.org/abs/2407.12275},
  eprint = {2407.12275},
  archiveprefix = {arXiv},
  primaryclass = {cs.LG},
}

Imaging Neurosci
A benchmark of individual auto-regressive models in a massive fMRI dataset

François Paugam, Basile Pinsard, Guillaume Lajoie, and Pierre Bellec

Imaging Neuroscience, Jul 2024

Abs DOI Bib

Dense functional magnetic resonance imaging datasets open new avenues to create auto-regressive models of brain activity. Individual idiosyncrasies are obscured by group models, but can be captured by purely individual models given sufficient amounts of training data. In this study, we compared several deep and shallow individual models on the temporal auto-regression of BOLD time-series recorded during a natural video-watching task. The best performing models were then analyzed in terms of their data requirements and scaling, subject specificity, and the space-time structure of their predicted dynamics. We found the Chebnets, a type of graph convolutional neural network, to be best suited for temporal BOLD auto-regression, closely followed by linear models. Chebnets demonstrated an increase in performance with increasing amounts of data, with no complete saturation at 9 h of training data. Good generalization to other kinds of video stimuli and to resting-state data marked the Chebnets’ ability to capture intrinsic brain dynamics rather than only stimulus-specific autocorrelation patterns. Significant subject specificity was found at short prediction time lags. The Chebnets were found to capture lower frequencies at longer prediction time lags, and the spatial correlations in predicted dynamics were found to match traditional functional connectivity networks. Overall, these results demonstrate that large individual functional magnetic resonance imaging (fMRI) datasets can be used to efficiently train purely individual auto-regressive models of brain activity, and that massive amounts of individual data are required to do so. The excellent performance of the Chebnets likely reflects their ability to combine spatial and temporal interactions on large time scales at a low complexity cost. The non-linearities of the models did not appear as a key advantage. In fact, surprisingly, linear versions of the Chebnets appeared to outperform the original non-linear ones. Individual temporal auto-regressive models have the potential to improve the predictability of the BOLD signal. This study is based on a massive, publicly-available dataset, which can serve for future benchmarks of individual auto-regressive modeling.
@article{paugam2024benchmark, title = {A benchmark of individual auto-regressive models in a massive fMRI dataset}, author = {Paugam, Fran\c{c}ois and Pinsard, Basile and Lajoie, Guillaume and Bellec, Pierre}, year = {2024}, month = jul, journal = {Imaging Neuroscience}, volume = {2}, pages = {1--23}, doi = {10.1162/imag_a_00228}, issn = {2837-6056}, url = {https://doi.org/10.1162/imag\%5Fa\%5F00228}, eprint = {https://direct.mit.edu/imag/article-pdf/doi/10.1162/imag\_a\_00228/2461525/imag\_a\_00228.pdf}, }

BEME

Using neural biomarkers to personalize dosing of vagus nerve stimulation

Antonin Berthon, Lorenz Wernisch, Myrta Stoukidi, Michael Thornton, Olivier Tessier-Lariviere, and 18 more authors

Bioelectronic Medicine, Jun 2024

DOI Bib

@article{berthon2024using,
  title = {Using neural biomarkers to personalize dosing of vagus nerve stimulation},
  author = {Berthon, Antonin and Wernisch, Lorenz and Stoukidi, Myrta and Thornton, Michael and Tessier-Lariviere, Olivier and Fortier-Poisson, Pascal and Mamen, Jorin and Pinkney, Max and Lee, Susannah and Sarkans, Elvijs and Annecchino, Luca and Appleton, Ben and Garsed, Philip and Patterson, Bret and Gonshaw, Samuel and Jakopec, Matjaz and Shunmugam, Sudhakaran and Edwards, Tristan and Tukiainen, Aleksi and Jennings, Joel and Lajoie, Guillaume and Hewage, Emil and Armitage, Oliver},
  year = {2024},
  month = jun,
  journal = {Bioelectronic Medicine},
  publisher = {Springer Science and Business Media LLC},
  volume = {10},
  number = {1},
  doi = {10.1186/s42234-024-00147-4},
  issn = {2332-8886},
  url = {http://dx.doi.org/10.1186/s42234-024-00147-4},
}

arXiv

Does learning the right latent variables necessarily improve in-context learning?

Sarthak Mittal, Eric Elmoznino, Leo Gagnon, Sangnie Bhardwaj, Dhanya Sridhar, and 1 more author

Jun 2024

arXiv Bib

@misc{mittal2024does,
  title = {Does learning the right latent variables necessarily improve in-context learning?},
  author = {Mittal, Sarthak and Elmoznino, Eric and Gagnon, Leo and Bhardwaj, Sangnie and Sridhar, Dhanya and Lajoie, Guillaume},
  year = {2024},
  url = {https://arxiv.org/abs/2405.19162},
  eprint = {2405.19162},
  archiveprefix = {arXiv},
  primaryclass = {cs.LG},
}

ICLR

Amortizing intractable inference in large language models

Edward J Hu, Moksh Jain, Eric Elmoznino, Younesse Kaddar, Guillaume Lajoie, and 2 more authors