論文 – 行李の底に収めたり[YuWd]

【論文メモ】Graph Transformer: A Generalization of Transformer Networks to Graphs

📅 2022/7/7 · ☕ 1 min read

任意のGraphに適応可能な, 汎用Transformer Positional Encodingがラプラシアン行列の固有値で表現されるラプラシアン行列の固有値 $\lambda$は頻度・周波数的な側面を持つ → グラフ上のフーリエ変換・畳み込みでは $\lambda$が使われる (いつかまとめる→todo) todo https://arxiv.org/pdf/2012.09699v2.pdf ...

#論文

【論文メモ】Graph Transformer: A Generalization of Transformer Networks to Graphs

【論文メモ】ViLD: Open-vocabulary Object Detection via Vision and Language Knowledge Distillation

📅 2022/7/7 · ☕ 1 min read

Open-Vocabulary (任意テキスト入力)な物体検出モデル classifierがCLIP特徴量になっている ...

#論文

【論文メモ】ViLD: Open-vocabulary Object Detection via Vision and Language Knowledge Distillation

【論文メモ】SwinIR: Image Restoration Using Swin Transformer

📅 2022/7/7 · ☕ 1 min read

残差接続が大量にあるの面白い多分だけど, 真っ黒から真っ黒への変換みたいな無意味な変換によって重みの学習を引っ張られたくないので, クソデカ残差を入れているのだと思う (オキモチ) SwinTransformerのおかげでパラメタ数はかなり減っている ...

#論文

【論文メモ】SwinIR: Image Restoration Using Swin Transformer

【論文メモ】LXMERT

📅 2022/7/7 · ☕ 1 min read

ViLBERTとの大きな違いは, ROIのみを入力とする点 ...

#論文

Impact Factor

📅 2022/7/6 · ☕ 1 min read

学術雑誌の影響力を測る指標らしい (そんなのあるんだ) 今年の被引用数を過去2年分のPublicationで割る $\displaystyle {\text{IF}}_{y}={\frac {{\text{Citations}}_{y}}{{\text{Publications}}_{y-1}+{\text{Publications}}_{y-2}}}.$ ...

【論文メモ】Do Transformer Modifications Transfer Across Implementations and Applications?

📅 2022/6/27 · ☕ 1 min read

Transformerの改善案は大量にあるが, 本当に有効なのはどれだけあるの？という論文結論 (有効な改善方法) 活性化関数: GLU+GeLU/Swish 正規化: RMS Norm パラメタ共有: デコーダの入出力における埋め込み表現を共有すると良いアーキテクチャ Mixture of Experts Transformer Synthesizer Product Key Memory ...

#論文

【論文メモ】CP-GAN

📅 2022/6/27 · ☕ 1 min read

todo ...

#論文

【論文メモ】CLIP

📅 2022/6/27 · ☕ 1 min read

CLIPによって, image↔textの特徴量変換が容易になったと言える → ViLD: Open-vocabulary Object Detection via Vision and Language Knowledge Distillation ...

【論文メモ】HAMT - History Aware Multimodal Transformer for Vision-and-Language Navigation

📅 2022/6/26 · ☕ 1 min read

パラメタの更新にActor-Criticを使用強化学習と模倣学習の両方を組み込んでいる ...

#論文

【論文メモ】HAMT - History Aware Multimodal Transformer for Vision-and-Language Navigation

【論文メモ】SOHO - Seeing Out of tHe bOx : End-to-End Pre-training for Vision-Language Representation Learning

📅 2022/6/26 · ☕ 1 min read

クラスタリングの上位互換みたいなことをするパッチを特徴空間に飛ばすパッチに映る物体が同じ種類の物体なら, その特徴が同じクラスタidに含まれるように学習 ...

#論文

【論文メモ】SOHO - Seeing Out of tHe bOx : End-to-End Pre-training for Vision-Language Representation Learning

【論文メモ】REVERIE - Remote Embodied Visual Referring Expression in Real Indoor Environments

📅 2022/6/26 · ☕ 0 min read

...

【論文メモ】REVERIE - Remote Embodied Visual Referring Expression in Real Indoor Environments

【論文メモ】Maximum Classifier Discrepancy for Unsupervised Domain Adaptation

📅 2022/6/19 · ☕ 1 min read

Domain Adaptation 従来手法 : sourceとtargetとで分布が違うはずなのに, ドメイン同士の境界(赤線)を基準に近づけようとしている → 分布の違いを考慮しつつ決定境界を修正する必要がある → GAN GAN風に学習する２つのclassifierとそれらを生成するgenerator ...

#論文

【論文メモ】Maximum Classifier Discrepancy for Unsupervised Domain Adaptation

【論文メモ】Manifold Mixup: Better Representations by Interpolating Hidden States

📅 2022/6/15 · ☕ 1 min read

どういうの？無作為に選んだ層までは普通に計算して，その層の出力の複数をランダムに選んでMixup そのままその値を使って最終層まで計算＆lossを計算し, 逆伝播決定境界が滑らかになるらしい簡単に説明すると, まず特徴量空間上で特徴量がflattenな状態に収束していくらしい flatten=小さい部分空間で表現できるというこ ...

#論文