Research Interests

Generally, my interests lie in the area of data analysis; using mathematical, statistical and computational techniques to learn as much as possible from a given set of data. Neuroscience, an exciting frontier of modern science, is an ideal area to explore these interests, and provides the opportunity to provide new understanding to the fundamental problem of how our brains work. Experimental advances in modern neuroscience provide a rich source of data sets, which can often be large (or worse, small!), noisy and difficult to collect. This provides a great challenge to support the work of my experimental colleagues by extracting the maximum value from the data they obtain.

Specifically my work to date has primarily consisted of applying tools from information theory to the problem of neural coding; that is how neurons represent information about the outside world. Information theory has a number of advantages for this; it is a measure of dependence between variables that is sensitive to both linear and non-linear effects, is non-parametric, placing no assumptions on the underlying system of study and provides quantitative results on a scale that can be meaningfully compared between different systems. It can be used to evaluate the timing precision of spikes, evaluate different candidate codes (for example spike rate vs temporal spike pattern; population pooled spike count vs labelled line population code), as well as to quantify the different effects of interactions between variables in a system with multivariate input or output (for example, interactions between spiking neurons). I am also interested in applying a wider variety of techniques from machine learning to these large scale problems of neurological data analysis in general, and in particular the issue of neural coding.

Technically, I am interested in practical techniques for improving calculation or estimation of information theoretic quantities both in terms of improving statistical properties, for example by correcting for the limited sampling bias and in terms of improving computational performance, which is important since many analyses require Monte Carlo style controls which involve repeating information calculations many times on shuffled or modelled data and can be a significant bottleneck. I am also interested in using numerical optimisation techniques to maximise information theoretic measures such as entropy and mutual information. I believe the properties of these quantities as intuitively appealing measures of uncertainty and dependence respectively suggest their value as objective functions for optimising models or other types of analysis. This approach as already been used successfully as a way to map neuronal receptive fields with correlated natural stimuli (Sharpee et al. 2002), is a foundation of the ID3 and C4.5 algorithms for fitting decision trees, and has been used to fit hidden Markov model predictors for discrete random sequences (Shalizi & Shalizi, 2004). During my PhD I developed an algorithm for investigating interactions of different orders in a system by efficiently finding probability distributions that maximize entropy subject to marginal constraints (included in pyEntropy) and I am keen to explore this approach of applying brute force optimisation to information theoretical objectives more in the future.

Current Projects

Information theoretic analysis tools

A major difficulty when estimating information theoretic quantities from experimental data is the problem of bias, a systematic error caused by limited sampling. While many techniques have been developed to correct for this effect, implementing them can be an involved task. I have developed pyEntropy, an open source Python library which implements a range of bias corrections. Having these tools freely available is important to encourage wider use of the techniques, and easily allow people to try a range of corrections or measures on their data (or on simulated data with similar statistical properties). Additionally, this library implements the information breakdown technique (Pola et al. 2003) which quantifies the effect of different types of interaction between the multivariate outputs (or inputs) of a system, as well as a tool for computing maximum entropy solutions subject to marginal equality constraints which can be used to obtain further details of the effect of interactions of different orders.

I am currently working on the statistical interpretation of mutual information as a test of independence, characterising the distribution under the null hypothesis and investigating the effects of temporal dependence in signals. I am also looking at applications of conditional mutual information in neural data analysis, for example when dealing with correlated stimuli, or to provide a form of robust group inference.

Information theoretic analysis of fMRI data

There are two fundamental approaches to the statistical detection of activated regions in fMRI imaging experiments; by using external stimulus information to find regions where activity correlates with stimulus changes (e.g. the General Linear Model [GLM]), or by purely data driven approaches which attempt to extract commonly activated areas from the data without any external information (e.g. Independent Component Analysis [ICA]). I am developing information theoretic techniques for both of these approaches.

In the first case, one can view mutual information between a stimulus condition and the BOLD response as the effect size for a statistical test of independence. If there is a significant deviation from independence, then the response is modulated by the stimulus and hence that region can be considered activated during the task. In some ways this is more general than traditional statistical comparisons such as the t-test; for example if the mean of the response did not change, but the variance did, mutual information would be sensitive to such an effect. P-values can be obtained directly from the information value, which allows direct comparison with results from other methods, such as the GLM. The advantage is that no assumptions are made about the response (for example normality), the mechanism of activation (no requirement for a HRF to be specified) or the linearity of the effect. A disadvantage is that if responses to different conditions overlap in time it is difficult to tease out the effect of the different conditions (the beauty of the GLM approach is that it allows such separation). I am currently working to develop the best way to apply these techniques to different block or event related designs, how to account for temporal dependence in the statistical inference and how the information approach can be extended to group analysis as well as producing high performance code to implement the analysis.

To address the second case, where stimulus information is not available, I am investigating graph-theoretic clustering methods together with information based dependency measures between individual voxels to obtain, in a purely data-driven way, areas that are activated together.


Google Scholar Citations

Publications

2011

  • RAA Ince*, A Mazzoni*, A Bartels, NK Logothetis and S Panzeri
    A novel test to determine the significance of neural selectivity to single and multiple potentially correlated features
    Journal of Neuroscience Methods (in press)
  • S Panzeri and RAA Ince (2011) Information theoretic approaches to the analysis of neural population recordings
    in N Kriegeskorte and G Kreiman (Eds) Visual population codes: toward a common multivariate framework for cell recording and functional imaging, MIT Press
    [ LINK (amazon) ]

2010

  • RAA Ince, R Senatore, E Arabzadeh, F Montani, ME Diamond and S Panzeri
    Information theoretic methods for studying population codes
    Neural Networks (2010) 23:6 p. 713-727
    [ LINK (Subscription Required) | PDF ]
  • RAA Ince, A Mazzoni, R Petersen and S Panzeri
    Open source tools for the information theoretic analysis of neural data
    Frontiers in Neuroscience (2010) 4:1 p. 62-70
    [ LINK (Open Access) ]

2009

  • RAA Ince, F Montani, E Arabzadeh, ME Diamond and S Panzeri
    On the presence of high-order interactions in somatosensory cortex and their effect on information transmission
    Journal of Physics: Conference Series (2009) 197 012013 (1pp)
    [ LINK (Open Access) ]
  • RAA Ince, C Bartolozzi and S Panzeri
    An information theoretic library for analysis of neural codes
    The Neuromorphic Engineer (2009) doi: 10.2417/1200906.1663
    [ LINK (Open Access) ]
  • F Montani*, RAA Ince*, R Senatore, E Arabzadeh, ME Diamond and S Panzeri
    The impact of high-order interactions on the rate of synchronous discharge and information transmission in somatosensory cortex
    Philosophical Transactions of the Royal Society A (2009) 367:1901 p. 3297-3310
    [ LINK (Subscription Required) | PDF ]
  • RAA Ince, RS Petersen, DC Swan and S Panzeri
    Python for information theoretic analysis of neural data
    Frontiers in Neuroinformatics (2009) 3:4.
    [ LINK (Open Access) ]

* - These authors contributed equally to this work.