Skip to Content
[top-nav]

Research Projects


Model Selection
Large-Scale 3D Vision
Sensor Data Fusion in Brake-by-Wire Technology
Robotic Aircraft
Optical Flow Calculation
Motion Segmentation
Range Segmentation

Model Selection

Problems in Model Selection for Range Segmentation
The available information theoretic model selection criteria are based on the assumptions that noise is very small and the data size is large enough. However, in visual data segmentation algorithms, one usually deals with noisy regions of data, some of which are small localised regions that are to be extended later. As a result, neither of the above assumptions is usually valid.

The other crucial problem is that almost all the information-theoretic model selection criteria measure the complexity of a model only by its number of parameters1. Hence, if there are different models with the same number of parameters (i.e. same complexity), there will be some difficulty in selecting between those models. In fact, the available model selection information-theoretic model selection criteria assume the set of candidate models to be nested, whereas, in many applications, such as range segmentation of curved objects, the set of candidate models may consist of non-nested models.

Finally, all the available model selection methods have their roots in statistics, the physical and geometrical characteristics of the problem have been ignored.

As a result none of the existing model selection criteria is capable of identifying the true underlying model for visual data in motion segmentation and range segmentation applications.

Surface Selection Criterion
To overcome the above problems in model selection, we proposes to view the sum of bending and twisting energies of a surface as a measure of the surfaces roughness and the sum of squared residuals as a measure of fidelity to the true data. There should be an acceptable compromise between these two factors. To formulate the proposed Surface model Selection Criterion (SSC), different points of a surface are viewed as hypothetical springs constraining the surface as shown in the below figure.

If the surface has little stiffness, then the surface bends and twists to meet the points. Hence, the surface fits itself to noise and the sum of squared residuals between the range measurements and their associated points on the surface will be small. This bending and twisting in turn increases the amount of strain energy accumulated by the surface.

Representation a malleable surface supported by hypothetical springs. The range measurements are shown by black circles.

Representation a malleable surface supported by hypothetical springs. The range measurements are shown by black circles.

As one may expect, increasing the number of parameters of a surface leads to larger bending and twisting energies as the surface has more degrees of freedom and consequently the surface can be fitted to the data by bending and twisting itself so that a closer fit to the measured data results. This can be inferred from the bending energy formula Equation 1). However, the higher the number of parameters for a surface model, the less the sum of squared residuals. For instance, in the extreme case, if the number of parameters is equal to the number of data points (which are used in the fitting process), then the sum of squared residuals will be zero whereas its sum of energies will be maximised.

If a plate is bent by a uniformly distributed bending moment so that the xy and yz planes are the principal planes of the deflected surface, then the strain energy (for bending and twisting) of the plate can be expressed as:

(1)

where D is the flexural rigidity of the surface and ν is Poisson’s ratio (ν should be very small because in real world-objects the twisting energy in comparison with the bending energy is small). In all the experiments reported in this thesis, n is assumed to be 0.01. As our experiments show, the performance of SSC is not sensitive to the small variation of n . The strain energy is computed as a measure of complexity of the model.

In order to scale the strain energy, it has been divided by the strain energy of the model with the highest number of parameters (Emax). Therefore, D has been eliminated from the following formulations.

To establish the trade-off between the sum of squared residuals and the strain energy EBending+Twist, a function SSC is defined such that:

where δ is the scale of noise for the highest surface (the surface with the highest number of parameters). The reason for using the scale of noise of the highest surface (as explained by Kanatani [1]) is that the scale of noise for the correct model and the scale of noise of the higher order models (higher than the correct model) must be close for the fitting to be meaningful. Therefore, it is the best estimation of the true scale of noise, which is available at this stage.

An accurate estimate of the scale of noise δcan be computed by where N is the number of data points and P is the number of parameters of the highest surface. Use of this formula for the scale of noise can be justified by the fact that if the model is correct, then is subjected to a χ<2 distribution with N-P degrees of freedom. The energy term has been multiplied by the number of parameters P in order to penalise the choice of a higher order (than necessary) model. Such a simple measure produces good discrimination and improves the accuracy of the model selection criterion.

Having devised a reasonable compromise between fidelity to data and the complexity of the model, the model selection task is then reduced to choosing the surface that has the minimum value of SSC.

top

Large-Scale 3D Vision

This research project is intended to provide the necessary resources for comprehensive large‑scale three‑dimensional mapping of man‑made environments. Achieving a complete 3‑D digital representation of the surrounding environment has, over the years, received substantial attention due to the plentifulness of its applications.

To achieve the goal of large-scale 3-D environmental mapping, a 3-D range scanner has been developed. The equipment is capable of producing range maps of far away objects (up to 300 meters) with relatively wide field-of-view. More importantly, range and intensity images produced by this equipment can be fused to have a complete set of (x, y, z, R, G, B) sextuplets of the scene.

The range scanner system is based upon a distance meter developed by MDL and a programmable pan-tilt unit manufactured by Directed Perception. The rangefinder is capable of producing distance measurement of 7-300 meters with repetition rate of 1000 Hz. The typical accuracy of the machine is 300 mm, which is 0.1% of the maximum measured distance.

In this project we mainly focus on development of an accurate and calibrated measurement device, which its measurements are sufficiently accurate for a wide range of scientific endeavours such as volumetric and surface recording of historical sites, virtual reality in environment building and modelling and architectural reproduction.

Following figures show an example of our experimental result. Range image, taken by range scanner device (left) and intensity image (middle), captured by video camera, has fused (right) to present a rich data set of the scene.

Range image taken by range scanner device intensity image captured by video camera


top

Sensor Data Fusion in Brake-by-Wire Technology

Brake-by-wire is a frontier technology that will allow many braking functions to switch to electronic actuation. It will lead to more effective and safe braking systems, elimination of hydraulic technology, release of space and reduction of maintenance. By the end of this decade, most international car manufacturers are envisaged to have adopted brake-by-wire systems across their range of passenger cars. A large variety of sensors are utilised in an Electro-Mechanical Braking (EMB) system and therefore their consistent operation is vital for the functionality of such system. To achieve a high level of coherency amongst such a large collection of sensors (mandated by the safety requirement of a brake system), the use of sophisticated data fusion techniques is unavoidable. Data fusion theory is generally considered the most appropriate tool to generate reliable and accurate information by virtue of combining all the available data. During the last two decades, theory of data fusion has been broadly developed and applied in many applications. Data fusion is generally defined as “synergistic combination of the information, provided by multiple sensory devices or multiple sources, to assist in the accomplishment of a task by a multi-sensor system”. Generally speaking, data fusion techniques are required in an EMB system for many important tasks that can be classified into the following three categories:

  • Effective Utilisation of Redundant Information

An EMB, by nature, is a safety critical system and therefore fault tolerance is a vitally important characteristic of this system. As a result, EMB systems are designed in such way that many of its essential information would be derived from a variety of sources (sensors) and be handled by more than the bare necessity hardware. Three main types of redundancy usually exist in an EMB system:

- Redundant sensors in safety-critical components such as the brake pedal

- Redundant copies of some signals that are of particular safety importance such as displacement and force measurements of the brake pedal

- Redundant hardware to perform important processing tasks such as central controllers

In order to utilise the existing redundancy, voting algorithms and probability‑based fusion techniques need to be evaluated, modified and adopted to meet the stringent requirements of an EMB system. Reliability, fault tolerance and accuracy are the main targeted outcomes of the fusion techniques that are developed especially for redundancy resolution inside an EMB system.

  • Estimation and Identification

In a typical EMB system, there are three categories of estimation and identification tasks that challenge the existing sensor data fusion techniques. Those categories are:

- Accurate and reliable estimation of environmental parameters (e.g. friction coefficient between the tyre and the road) by integration of multiple sensors or sources of data

-Parametric and non-parametric identification of unknown dynamics in the system (e.g. calliper dynamics)

-Estimation of system states or variables by fusing multi-source information (e.g. vehicle state estimation, estimation of the zero crossing point on the actuator characteristic curve and clamp force estimation).

  • Control

The central controllers in the system are mainly responsible for high level braking functions such as four-wheel proportioning, automatic hill hold, antilock brake control, traction control, vehicle stability control and so on. Each of these control tasks must be accomplished based on the data received from multiple sensors and sources. Hence, robust and accurate multisensor fusion methods are required to be developed if high level control tasks of the EMB system are to be successfully executed.

We have been working on different fusion problems that have arisen during the design and testing of an EMB system. A new voting algorithm for fusion of the brake pedal sensors, a novel predictive filter for compensation of the missing samples of sensor data (filed as an Australian patent), and a new angle tracking observer for estimation of the position and speed of brushless DC motors based on a resolver sinusoidal signals (filed as another Australian patent) have been developed sofar.

top

Robotic Aircraft

The main goal of this project is to develop reliable computer vision techniques to be utilised in the control of low altitude manoeuvres for autonomous small-scale aircraft. In particular, we intend to focus our attention on recovering the relative altitude of an aircraft by using the terrain texture seen from an on-board camera. We aim to develop techniques that would work reliably on almost any terrain and weather condition. We are also committed to provide evaluation techniques capable of verifying our estimates (as they are generated) and associate a probabilistic reliability measure to the sensor output.

Click here to watch a clip which is taken by the aircraft.

The model plane the engine mounting

The model plane

The Engine Mounting

the camera support  

The Camera Support

 

top

Optical Flow Calculation

Differential optic flow techniques try to relate local changes in image intensity (expressed as spatial and temporal derivatives of the image brightness function) to the optic flow. Differential techniques usually perform faster than image matching or phase based techniques and lead to a simple set of linear equations. The fundamental assumption behind all the optic flow techniques is the fact that the brightness intensity function of a moving object remains approximately constant at least for short duration of time Optical Flow is the motion of objects on the image plane as objects (of the camera) moves. This has applications in robotics and in video coding technology. Of particular recent interest, is the study of motion segmentation - how to break an image up into the parts corresponding to objects undergoing different motions. A significant factor in this research, of late, is the use of Robust Statistics. 
A major recent advance was the use of robust statistics to calculate the flow in small patches (with which the flow is assumed to be uniform) [Bab-Hadiashr & Suter 1998]. 

A (LARGE!) mpeg file shows a screen capture of our optic flow demonstration code - 

30Mb Mpeg file (created by Dr. David Suter

top

Motion Segmentation

Segmentation is a vital task in many computer vision problems. In our work we have concentrated upon applying our segmentation methods to the segmentation of optical flow (motion) and range data. However, our methods are sufficiently general and robust as to apply to virtually any segmentation problem. Our focus has shifted to motion segmentation to overcome the assumption that the flow is uniform in certain patches and on the selection of motion models. Major results in this direction can be found in [Bab-Hadiashr & Suter 1999]. 
Following figures show an example of our approach to the motion segmentation problem . 

Otte sequence and its measured flow field Otte sequence and its measured flow field

Otte sequence and its measured flow field

Segmentation Results - First and Last Stage Segmentation Results - First and Last Stage

Segmentation Results - First and Last Stage

A (LARGE!) mpeg file shows a sequence segmentation based upon motion. 

17Mb Mpeg file (created by Dr. David Suter)

17Mb Mpeg file (created by Dr. David Suter)

top

Range Segmentation

Range data are usually clean and accurate and although they are made of multi-structures (at least in our examples), they do not have many impulsive outliers. Thus, the range data segmentation task probably does not challenge our methods as greatly as the motion segmentation task. However, clearly there is still some utility in employing robust segmentation methods on range data. Following figures show an example of our approach to range data segmentation (for detail, see Bab-Hadiashr & Suter 1999).

Intensity image, original and segmented range data

top