stereovslam

Feature-based visual simultaneous localization and mapping (vSLAM) and visual-inertial sensor fusion with stereo camera

Since R2024a

Description

Use the stereovslam object to perform visual simultaneous localization and mapping (vSLAM) with stereo camera data. To learn more about visual SLAM, see Implement Visual SLAM in MATLAB.

The stereovslam object extracts Oriented FAST and Rotated BRIEF (ORB) features from incrementally read images, and then tracks those features to estimate camera poses, identify key frames, and reconstruct a 3-D environment. The vSLAM algorithm also searches for loop closures using the bag-of-features algorithm, and then optimizes the camera poses using pose graph optimization. You can enhance the accuracy and robustness of the SLAM by integrating this object with IMU data to perform visual-inertial sensor fusion.

Creation

Syntax

vslam = stereovslam(intrinsics,baseline)

vslam = stereovslam(reprojectionMatrix,imageSize)

vslam = stereovslam(___,imuParameters)

vslam = stereovslam(___,PropertyName=Value)

Description

vslam = stereovslam(intrinsics,baseline) creates a stereo visual SLAM object, vslam, using the rectified stereo camera intrinsic parameters intrinsics, and the baseline distance between the rectified left and right cameras.

The object represents 3-D map points and camera poses in world coordinates, and assumes the camera pose of the first key frame is an identity rigidtform3d transform.

Note

The stereovslam object runs on multiple threads internally, which can delay the processing of an image frame added by using the addFrame function. Additionally, the object running on multiple threads means the current frame the object is processing can be different than the recently added frame.

example

vslam = stereovslam(reprojectionMatrix,imageSize) creates a stereo visual SLAM object vslam using the stereo camera reprojection matrix, reprojectionMatrix, and the image size, imageSize.

vslam = stereovslam(___,imuParameters) performs stereo visual-inertial SLAM based on the specified imuParameters. Using this argument with IMU data, requires the Navigation Toolbox™

vslam = stereovslam(___,PropertyName=Value) sets properties using one or more name-value arguments in addition to any combination of input arguments from previous syntaxes. For example, MaxNumPoints=850 sets the maximum number of ORB feature points to extract from each image to 850.

Input Arguments

expand all

`intrinsics` — Rectified stereo camera intrinsic parameters
`cameraIntrinsics` object

Rectified stereo camera intrinsic parameters, specified as a cameraIntrinsics object.

This argument sets the Intrinsics property.

`baseline` — Distance between rectified left and right cameras
nonnegative scalar

Distance between the rectified left and right cameras, specified as a nonnegative scalar. Stereo vSLAM algorithms typically track the primary (or left) camera, in which case the baseline is greater than zero. A negative baseline value indicates a negative disparity range, and the vSLAM algorithm tracks the secondary (or right) camera instead.

This argument sets the Baseline property.

`reprojectionMatrix` — Reprojection matrix
4-by-4 matrix

Reprojection matrix, specified as a 4-by-4 matrix of the form:

where f and (cx, cy) are the focal length and principal point of rectified primary camera, respectively. b is the baseline of the virtual rectified stereo camera.

You can obtain the reprojection matrix by using the rectifyStereoImages function.

`imageSize` — Image size produced by camera
two-element vector

Image size produced by the camera, in pixels, specified as a two-element vector of the form [nrows ncols]. The elements nrows and ncols represent represents the number of rows and columns, respectively.

`imuParameters` — IMU parameters
`factorIMUParameters` object

IMU parameters, specified as a factorIMUParameters (Navigation Toolbox) object. The object contains the noise, bias, and sample rate information about the inertial measurement unit (IMU). Using this argument with IMU data, requires the Navigation Toolbox.

Properties

expand all

Camera Parameters

`Intrinsics` — Camera intrinsic parameters
Read-only: `cameraIntrinsics` object

This property is read-only.

Camera intrinsic parameters, stored as a cameraIntrinsics object.

Use the intrinsics argument to set this property.

`Baseline` — Distance between rectified left and right cameras
nonnegative scalar

Distance between the rectified left and right cameras, stored as a nonnegative scalar. Stereo vSLAM algorithms typically track the primary (or left) camera, in which case the baseline is greater than zero. A negative baseline value indicates a negative disparity range, and the vSLAM algorithm tracks the secondary (or right) camera instead.

Use the baseline argument to set this property.

`DisparityRange` — Disparity range
Read-only: `[0 48]` (default) | two-element vector of integers

This property is read-only.

Disparity range, specified as a two-element vector of integers of the form [min max]. The elements specify the minimum and maximum disparity, respectively. The range must be within the width of the image, and the difference between the minimum and the maximum must be divisible by 16.

`UniquenessThreshold` — Minimum value of uniqueness
`15` (default) | nonnegative integer

Minimum value of uniqueness, specified as a nonnegative integer.

The function marks the estimated disparity value K for a pixel as unreliable if:

v < V×(1 + 0.01×UniquenessThreshold),

where V is the sum of the absolute difference (SAD) corresponding to the disparity value K, and v is the smallest SAD value over the whole disparity range, excluding K, K–1, and K+1.

Increasing the value of UniquenessThreshold results in the function marking disparity values for more pixels as unreliable. To turn off the uniqueness threshold, set this value to 0.