facebookresearch
diff --git a/‎website/docs-technical-specs/device/cad.mdx‎
Lines changed: 1 addition & 1 deletion b/‎website/docs-technical-specs/device/cad.mdx‎
Lines changed: 1 addition & 1 deletion
diff --git a/‎website/docs-technical-specs/device/calibration.mdx‎
Lines changed: 2 additions & 0 deletions b/‎website/docs-technical-specs/device/calibration.mdx‎
Lines changed: 2 additions & 0 deletions
diff --git a/‎website/docs-technical-specs/device/calibration_insights/_category_.json‎
Lines changed: 4 additions & 0 deletions b/‎website/docs-technical-specs/device/calibration_insights/_category_.json‎
Lines changed: 4 additions & 0 deletions
diff --git a/‎website/docs-technical-specs/device/calibration_insights/camera_intrinsics_models.mdx‎
Lines changed: 143 additions & 0 deletions b/‎website/docs-technical-specs/device/calibration_insights/camera_intrinsics_models.mdx‎
Lines changed: 143 additions & 0 deletions
diff --git a/‎website/docs-technical-specs/device/calibration_insights/sensor_measurement_model.mdx‎
Lines changed: 35 additions & 0 deletions b/‎website/docs-technical-specs/device/calibration_insights/sensor_measurement_model.mdx‎
Lines changed: 35 additions & 0 deletions
diff --git a/‎website/static/img/docs-technical-specs/calibration/fisheye.png‎
84 KB b/‎website/static/img/docs-technical-specs/calibration/fisheye.png‎
84 KB
diff --git a/‎website/static/img/docs-technical-specs/calibration/kb3.png‎
67.4 KB b/‎website/static/img/docs-technical-specs/calibration/kb3.png‎
67.4 KB
diff --git a/‎website/static/img/docs-technical-specs/calibration/linear.png‎
12.3 KB b/‎website/static/img/docs-technical-specs/calibration/linear.png‎
12.3 KB
diff --git a/‎website/static/img/docs-technical-specs/calibration/spherical.png‎
78 KB b/‎website/static/img/docs-technical-specs/calibration/spherical.png‎
78 KB
@@ -1,5 +1,5 @@
 ---
-sidebar_position: 3
+sidebar_position: 4
 title: CAD File Downloads
 ---
 The table below outlines the 8 size versions for the Gen2 design, included are both states for the hinge: open and closed. Also, we do include a portion with camera FOV’s.
 
@@ -34,6 +34,7 @@ The following table shows which calibration data is available for each sensor ty
 | **Speakers** | `LSPK`, `RSPK` | ✅ Sensitivity (dBV) | ❌ |
 
 ### Camera Projection Models
+Details of the camera intrinsics model can be found in [calibration insight](/technical-specs/device/calibration_insights/camera_intrinsics_models)
 
 - **FisheyeRadTanThinPrism**: Used for RGB and SLAM cameras
   - Parameters: focal length (f), principal point (cx, cy), radial distortion (k0-k5), tangential distortion (p0, p1), thin prism (s0-s2)
@@ -42,6 +43,7 @@ The following table shows which calibration data is available for each sensor ty
   - Parameters: focal lengths (fx, fy), principal point (cx, cy), Kannala-Brandt distortion coefficients (kb0-kb3)
 
 ### IMU Calibration Components
+Details of the IMU models can be found in [calibration insight](/technical-specs/device/calibration_insights/sensor_measurement_model)
 
 - **Accelerometer**: Bias offset (m/s²) and 3x3 rectification matrix for scale/cross-axis corrections
 - **Gyroscope**: Bias offset (rad/s), rectification matrix, and G-sensitivity matrix
 
@@ -0,0 +1,4 @@
+{
+  "label": "Calibration insights",
+  "position": 3
+}
@@ -0,0 +1,143 @@
+---
+sidebar_position: 1
+title: Camera Intrinsic Models
+---
+# Camera Intrinsic Models for Project Aria devices
+
+This page provides an overview of the intrinsic models used by RGB, Eye Tracking and Mono Scene (aka SLAM) cameras in Project Aria glasses.
+
+A camera intrinsic model maps between a 3D world point in the camera coordinate and its corresponding 2D pixel on the sensor. It supports mapping from the 3D point to the pixel (projection) and from the pixel to the ray connecting the point and the camera's optical center.
+
+Our projection models are based on polar coordinates of 3D world points. Given a 3D world point in the device frame $\mathbf{P}_d$, we first transform it to the camera's local frame
+$$
+\mathbf{P}_c = (x, y, z) = T_\text{device}^\text{camera}\mathbf{P}_d
+$$
+
+the corresponding polar coordinates $\Phi = (\theta, \varphi)$ that satisfies
+$$
+    x/z = \tan(\theta)\cos(\varphi), \quad
+    y/z = \tan(\theta)\sin(\varphi).
+$$
+
+We assume the camera has a single optical center and thus all points of the same polar coordinate maps to the same 2D pixel $\mathbf{p}$:
+$$
+    \mathbf{p} = f(\phi)
+$$
+Here $f$ is the camera projection model.
+
+Inversely, we can unproject from a 2D camera pixel to the polar coordinate by
+$$
+    \Phi = f^{-1}(\mathbf{p})
+$$
+
+In Aria we support four types of project models, Linear, Spherical, KannalaBrandtK3, and FisheyeRadTanThinPrism. The linear camera model are standard textbook intrinsic models and good for image rectification. However, cameras on the Aria glasses all have fisheye lenses, and spherical camera model are much better approximations for these glasses. In order to calibrate the camera lenses at a high quality, we use two more sophisticated camera models to add modeling of radial and tangential distortions.
+
+![Image](/img/docs-technical-specs/calibration/linear.png)
+![Image](/img/docs-technical-specs/calibration/spherical.png)
+![Image](/img/docs-technical-specs/calibration/kb3.png)
+![Image](/img/docs-technical-specs/calibration/fisheye.png)
+
+The next table shows which model is used for each type of Aria camera:
+
+| Camera Type               | Intrinsics Model        |
+|---------------------------|-------------------------|
+| Slam Camera               | FisheyeRadTanThinPrism  |
+| Rgb Camera                | FisheyeRadTanThinPrism  |
+| Eye-Tracking Camera       | KannalaBrandtK3         |
+
+## The linear camera model
+The linear camera model (a.k.a pinhole model) is parametrized by 4 coefficients : f_x, f_y, c_x, c_y.
+
+$(f_x, f_y)$ are the focal lengths, and $c_x, c_y$ are the coordinate of the projection of the optical axis.
+It  maps from world point $(x,y,z)$ to 2D camera pixel $\mathbf{p}=(u, v)$ with the following formulae.
+$$
+    u = f_x x/z + c_x \\
+    v = f_y y/z + c_y
+$$
+Or, in polar coordinates:
+$$
+    u = f_x tan(\theta) \cos(\varphi) + c_x, \\
+    v = f_y tan(\theta) \sin(\varphi) + c_y.
+$$
+
+Inversely, we can unproject from 2D camera pixel $\mathbf{p}=(u, v)$  to the homogeneous coordinate of the world point by
+$$
+x/z=(u-c_x)/f_x, \\
+y/z=(v-c_y)/f_y.
+$$
+The linear camera model preserves linearity in 3D space, thus straight lines in the real world are supposed to look straight under the linear camera model.
+
+## The spherical camera model
+
+The spherical camera model is, similarly from the linear camera model parametrized by 4 coefficients : f_x, f_y, c_x, c_y.
+The pixel coordinates are linear to solid angles rather than the homography coordinate system.
+The projection function can be written in polar coordinates
+$$
+    u = f_x \theta \cos(\varphi) + c_x, \\
+    v = f_y \theta \sin(\varphi) + c_y.
+$$
+Note the difference from the linear camera model — under spherical projection, 3D straight lines look curved in images.
+
+Inversely, we can unproject from 2D camera pixel $\mathbf{p}=(u, v)$ to the homogeneous coordinate of the world point by
+$$
+    \theta = \sqrt{(u - c_x)^2/f_x^2 + (v - c_y)^2/f_y^2}, \\
+    \varphi = \arctan((u - c_x)/f_x, (v - c_y)/f_y).
+$$
+
+## The KannalaBrandtK3 (KB3) model
+
+The KannalaBrandtK3 model adds radial distortion to the linear model
+$$
+    u = f_x r(\theta) \cos(\varphi)  + c_x, \quad
+    v = f_y r(\theta) \sin(\varphi)  + c_y.
+$$
+where
+$$
+    r(\theta) = \theta + k_0 \theta^3 + k_1 \theta^5 + k_2 \theta^7 + k_3 \theta^9 + ...
+$$
+In KannalaBrandtK3 model we use a 9-th order polynomial with four radial distortion parameters $k_0, ... k_3$.
+
+To unproject from camera pixel $(u, v)$ to the world point $(\theta, \varphi)$, we first compute
+$$
+    \varphi = \arctan((u - c_x)/f_x, (v - c_y)/f_y) \\
+    r(\theta) = \sqrt{(u - c_x)^2/f_x^2 + (v - c_y)^2/f_y^2}
+$$
+Then we use Newton method to inverse the function $r(\theta)$ to compute $\theta$. See the code [here](https://github.com/facebookresearch/projectaria_tools/blob/afad1fe09dd1d89eee55ceb95ba1f2f577f9c606/core/calibration/camera_projections/KannalaBrandtK3.h#L131-L147).
+
+## The Fisheye62 model
+
+The Fisheye62 model adds tangential distortion on top of the KB3 model parametrized by two new coefficients: p_0 p_1.
+$$
+    u = f_x . (u_r + t_x(u_r, v_r))  + c_x, \\
+    v = f_y . (v_r + t_y(u_r, v_r))  + c_y.
+$$
+where
+$$
+    u_r = r(\theta) \cos(\varphi), \\
+    v_r = r(\theta) \sin(\varphi).
+$$
+and
+$$
+    t_x(u_r, v_r)  = p_0(2 u_r^2 + r(\theta)^2) + 2p_1u_rv_r, \\
+    t_y(u_r, v_r)  = p_1(2 v_r^2 + r(\theta)^2) + 2p_0u_rv_r.
+$$
+
+To unproject from camera pixel $(u, v)$ to the world point $(\theta, \varphi)$, we first use Newton method to compute $u_r$ and $v_r$ from $(u - c_x)/f_x$ and $(v - cy)/f_y$, and then compute $(\theta, \varphi)$ using the above KB3 unproject method.
+
+## The FisheyeRadTanThinPrism (Fisheye624) model
+
+The FisheyeRadTanThinPrism (also called Fisheye624 in file and codebase) models thin-prism distortion (noted $tp$) on top of the Fisheye62 model above.
+Its parametrization contains 4 additional coefficients: s_0 s_1 s_2 s_3. The projection function writes:
+$$
+    u = f_x \cdot (u_r + t_x(u_r, v_r) + tp_x(u_r, v_r))  + c_x, \\
+    v = f_y \cdot (v_r + t_y(u_r, v_r) + tp_y(u_r, v_r))  + c_y.
+$$
+u_r, v_r, t_x, t_y are defined as in the Fisheye62 model, while $tp_x$ and $tp_y$ are defined as:
+$$
+   tp_x(u_r, v_r) = s_0 r(\theta)^2 + s_1  r(\theta)^4, \\
+   tp_y(u_r, v_r) = s_2 r(\theta)^2 + s_3  r(\theta)^4.
+$$
+
+To unproject from camera pixel $(u, v)$ to the world point $(\theta, \varphi)$, we first use Newton method to compute $u_r$ and $v_r$ from $(u - c_x)/f_x$ and $(v - cy)/f_y$, and then compute $(\theta, \varphi)$ using the above KB3 unproject method.
+
+Note that in practice, in our codebase and calibration file we assume $f_x$ and $f_y$ are equal.
@@ -0,0 +1,35 @@
+---
+sidebar_position: 2
+title: Sensor Measurement Model
+---
+
+# Sensor Measurement Models in Project Aria Devices
+
+This page provides an overview of how Project Aria device sensor measurements are modeled for IMU, magnetometer, barometer and audio.
+## IMUs
+
+For IMUs, we employ an affine model where the value from the readout of accelerometer $s_a$ or gyroscope $s_g$, is compensated to obtain a "real" acceleration $a$ and angular velocity $\omega$ by
+$$
+a = M_a^{-1}(s_a - b_a) \qquad
+\omega = M_g^{-1}(s_g - b_g)
+$$
+$M_a$ and $M_g$ are assumed to be upper triangular so that there is no global rotation from the imu body frame to the accelerometer frame.
+
+Inversely, we can simulate the sensor read-out from acceleration or angular velocity by
+$$
+s_a = M_a a + b_a \qquad
+s_g = M_g \omega + b_g
+$$
+
+When the read-out signal exceeds a threshold, the signal saturates. Saturation limits are sensor dependent and referenced in the following table for accelerometer and gyrometers.
+
+||accel-left | accel-right| gyro-left | gyro-right|
+|--|--|--|--|--|
+|saturation|8g|16g|2000|2000|
+
+
+## Magnetometer, barometer and audio
+
+Similar to the IMU rectification model, the sensor readouts for magnetometer, barometer, and audio data are modeled as linear to the real $r$ (magnetic field, air pressure and sound intensity).
+
+Audio specifically is bias only.
-Original file line number
+Diff line change
@@ @@ -0,0 +1,4 @@ @@
 +{
 +  "label": "Calibration insights",
 +  "position": 3
 +}