CV – 行李の底に収めたり[YuWd]

【NeRF】動画から点群・メッシュ・任意視点動画を生成してみる

📅 2024/1/18 · ☕ 4 min read

NeRFを使えば，点群・メッシュ・任意視点動画が作れるのでやってみた今回は愛飲するRedBullを被写体にしてみるヨ！任意視点動画 (GIF版) 任意視点動画 (動画版) Your browser does not support the video tag. 点群 NeRFとnerfstudioについて簡潔に説明ボリュームレンダリングある点

x

と方向

d

を入力として

(c, σ)

を出力 ...

How to create Matterport3D segmentation images?

📅 2022/11/24 · ☕ 1 min read

Intro The other day, one of my labmates needed to make a segmentation of Matterport3D. He asked for help, and I got involved in creating the segmentation. However, it turned out to be a real struggle. We were not used to 3D mesh models. After several weeks, we completed the code to create a semantic segmentation image for Matterport3D. How to create Matterport3D segmentation images Matterport3D provides access to 3D segmentation but does not give users an easy way to access 2D. Matterport3D data only provides point clouds and meshes labeled by ground truth, and the user must add color directly to the point clouds and meshes to create 2D segmentations. We, therefore, wrote code using Matterport3DSimulator to place a camera for a given scan_id and viewpoint_id and create a segmentation from the original ply file. When we run our code, we get the following image. (I concatenated the obtained images and converted to a gif) Matterport3DSimulator takes a total of 36 pictures: 12 at the top, 12 at the perimeter, and 12 at the bottom. ...

How to create Matterport3D segmentation images?

【論文メモ】Can Neural Nets Learn the Same Model Twice? Investigating Reproducibility and Double Descent from the Decision Boundary Perspective

📅 2022/9/24 · ☕ 4 min read

はじめに CVPR22 決定境界を描画し, 再現性と汎化性について定量的に考察した論文決定境界の描画 (領域の決定) 如何に決定境界を描画するかが重要になってくるその上でまず, データ多様体

M

の近傍(on-manifold)を取るのか,

M

から離れた領域(off-manifold)を取るのかを考 ...

【論文メモ】Can Neural Nets Learn the Same Model Twice? Investigating Reproducibility and Double Descent from the Decision Boundary Perspective

Vuforia: ARマーカーは非対称な配置が精度を上げる

📅 2022/8/16 · ☕ 1 min read

Vuforiaのマーカー検出精度はゴミ出来るだけARマーカーの検出精度を上げたい Vuforiaは何を見てマーカーを判断しているのか？コーナー検出っぽいのやって特徴量を抽出してる模様 https://library.vuforia.com/objects/best-practices-designing-and-developing-image-based-targets なるだけ特徴量をまばらに(=対称性を持たせない様式で)配置すると良いようだ下の図はopencvでもvuforiaでも簡単に使えるARマーカ ...

#CV
#post

PCA Color Augmentation (PCACA)

📅 2022/7/25 · ☕ 1 min read

AlexNetで使われたらしいData augumentation手法そんなに使われてるイメージはない. 古代の手法？？ Fancy PCA / PCACAとも言うらしい？(要出典) 画像中の色の分布を考慮したデータ拡張ができる例えば, 明るいところは明るく, 暗いところは暗く調節できる流れは簡単

C \times H \times W

をflattenして,

C \times H W

にする各チャネ ...

Perspective-n-Point問題

📅 2022/7/23 · ☕ 1 min read

世界座標系における3D点群と, それらに対応する2D画像が与えられた場合において, カメラのポーズ推定を行う問題カメラのポーズは平行移動と回転の6DOFで, Perspective-n-Point問題はPnPと略されることが多い P3Pは最低三点あれば解ける一般化されたPnPを解くアルゴリズムは様々ある EPnP SQPnP: A Consistently Fast and Globally Optimal Solution to the Perspective-n-Point ...

#CV
#post

SQPnP: A Consistently Fast and Globally Optimal Solution to the Perspective-n-Point Problem

📅 2022/7/23 · ☕ 1 min read

ECCV2020 ...

#CV
#post

Canny法

📅 2022/7/16 · ☕ 1 min read

かなり古い(1986年)エッジ検出手法授業で使ってるチームがいたので気になって調べてみた流れノイズをへらすため, ガウシアンフィルタソーベルフィルタを用いて画素値を微分し, 勾配と法線ベクトルを求める →ついでにこれで輪郭を抽出したことになる抽出した輪郭線を細くしていく → ある画素

x

に注目したとき, 法線方向に隣接する2点を ...

#CV
#post

MPI (Multiplane Image)

📅 2022/7/12 · ☕ 1 min read

画像を複数レイヤーの重なりとして分解する →MPI表現 ...

#CV
#post

【論文メモ】NeRF

📅 2022/7/11 · ☕ 1 min read

Positional Encoding 低周波成分である

(x, y, z, θ, ϕ)

から高周波成分である

(R, G, B, D)

を復元するため, 以下に示すPositional Encodingを通したあとにMLPに入力

γ (t) = (\sin (2^{0} t π), \cos (2^{0} t π), \dots, \sin (2^{L} t π), \cos (2^{L} t π))

この機構をNTKによって実験した論文→ Fourier Features Let Networks Learn High Frequency Functions in Low Dimensional Domains このPositional Encodingはフーリエ特 ...

#論文
#CV

epipolar

📅 2022/6/26 · ☕ 1 min read

「カメラ

O_{L}

から見て

X_{L}

に物体が写っている」とき、「カメラ

O_{R}

から見てどこに物体が写るか」の候補 https://qiita.com/Thought_Nibbler/items/9cb7c2637000eecc1a30 “平行な目"では, epipolar lineは平行になる https://gyazo.com/0357ac0708ea93a2036c5c88a388a321[https://gyazo.com/73088515f6591c4dc94313d88accc163] ...