Nick Ball - Research - Intro

This is a short nontechnical introduction to my (then) work that I made during my PhD research at Sussex.

Galaxies

The number of parameters describing a galaxy can vary from one to infinity. It is desirable to have as much information as possible about the physical processes involved in the least number of parameters. The smallest number here is of course one, and so the notion of a galaxy `type' is useful. The neural networks are a way of mapping in a general nonlinear way the parameters (e.g. the standard outputs from the Sloan Survey) onto a type.

Measures of type include:

Hubble Tuning Fork  

Image from Gene Smith's Astonomy Tutorial



Large Scale Structure

My original motivation for studying this area of astronomy was not neural networks but cosmology and the large scale structure of the universe. In particular the book `Large Scale Structures in the Universe' by Anthony Fairall, purchased at the Federation of Astronomical Societies meeting in Cambridge in 1996 was influential. The tricky part is finding something original to do in the area which is not already being done by other members of the Sloan Survey collaboration. Thus looking at galaxy types and large scale structure, which has been done but can be done in unprecendented detail with the Sloan data, and the use of a method for getting large samples of typed galaxies, sounded interesting. The neural nets are simply one way of doing this.


2dF redshift coneplot

Typical `cone plot' of galaxy position versus redshift. This is from the 2dF Galaxy Redshift Survey rather than the Sloan : ). Note that every galaxy is treated in the same way, as a dot.


The goal of the studies are not so much detailed maps of the local universe for their own sake, but the statistics, which give information about cosmology, structure and galaxy formation. An important factor in determining what can be done is whether we have the distance to a galaxy. For most galaxies (except those nearby), this requires a redshift, which requires a spectrum. These take more effort to obtain than images - the Sloan survey will acquire about one million galaxy spectra, but about 50 million galaxy images, and these to a much fainter magnitude. Statistics with the redshifts have the advantage of 3D information, and those with images have greater numbers of galaxies.

Neural Networks

Although originally motivated by models of biological neurons, the artificial neural networks here are simply a nonlinear statistical method. The idea is that a set of parameters is mapped onto an output or outputs by adjusting the weights between the interconnnected neurons by the use of a training algorithm. The networks used here are in the common form of one dimensional layers of neurons, each neuron in each layer being connected to each neuron in adjacent layers but no others. They are known as `multilayer perceptrons', although the neurons in fact represent a nonlinear operation on the sum of their inputs rather than a threshold, which is what a perceptron is.


Schematic neural network

Schematic artificial neural network (Storrie-Lombardi et al, MNRAS 259 8P)

The most commonly used training algorithm in astronomy has been `backpropagation', where the weights are adjusted according to an algorithm which is propagated back through the network. Here a faster algorithm is used, the Levenberg-Marquardt. In networks with up to a few hundred weights this is thought to be the fastest to converge to a minimum in the `error space'. This is the space of values of the `cost function', a measure of the difference between the network output and the desired type, which is given by a training set in which parameters corresponding to known types are presented in turn to the network. The network is then able to assign types to further sets of parameters of which the training set was representative. The plots in the results pages show how the networks here manage at doing this for target types for galaxies, by plotting the network type against the target type for galaxies which have a target type, but which the network has not seen - a test set.


Last modified: Mon Dec 20 17:18:05 CST 2004

Research
Home