Tong Gao
March 2020
Lear Learning M ning Multi ulti-Moda Modal l Grounded Lingu - - PowerPoint PPT Presentation
March 2020 Lear Learning M ning Multi ulti-Moda Modal l Grounded Lingu Grounded Linguistic istic Semantics by Playing I Spy Tong Gao Introduction Early work on grounded language learning enabled a machine to map from
Tong Gao
March 2020
–
–
–
–
–
–
𝐷
–
–
–
efforts & joint positions, recorded for 6 joints at 15 Hz
frequency bins
–
𝐻𝑞 𝜆𝑑2 × 𝑁𝑑2 𝑌𝑗
𝑑2
𝜆𝑑1 × 𝑁𝑑1 𝑌𝑗
𝑑1
𝜆𝑑18 × 𝑁𝑑18 𝑌𝑗
𝑑18
… 𝜆𝑑 ∈ [0,1], Cohen’s Kappa, measuring the performance of 𝑁𝑑 on the ground truth labels # contexts: 18 𝑁𝑑(𝑌𝑗
𝑑) ∈ [−1,1], a quadratic-kernel
SVM For each language predicate 𝑞, a classifier 𝑯𝒒 ∈ [−1,1] is learned to decide whether objects possessed the attribute 𝑞:
– Correlated since the sign is positive – With confidence 0.137 = 0.137
– In 𝐻𝑔𝑏𝑢, we are confident on the decision made by classifier 𝑁𝑠𝑏𝑡𝑞,𝑏𝑣𝑒𝑗𝑢𝑝𝑠𝑧
–
–
–
–
–
– Tall - height (0.521) – Small - width (-0.665) – Water - weight (0.814) – Blue - weight (0.549, spurious)
while the object is not destroyed?
redundant - Ablation study
–
Classify new predicates by their distance to learned predicates in Wordnet or word embeddings?
–
If that’s all what they want, why not measure these properties with ruler & weight scale?
Should select some physical properties that can only be obtained by multi-modal system.