Command Palette
Search for a command to run...
Nihal V. Nayak Stephen H. Bach

Abstract
Zero-shot learning relies on semantic class representations such as hand-engineered attributes or learned embeddings to predict classes without any labeled examples. We propose to learn class representations by embedding nodes from common sense knowledge graphs in a vector space. Common sense knowledge graphs are an untapped source of explicit high-level knowledge that requires little human effort to apply to a range of tasks. To capture the knowledge in the graph, we introduce ZSL-KG, a general-purpose framework with a novel transformer graph convolutional network (TrGCN) for generating class representations. Our proposed TrGCN architecture computes non-linear combinations of node neighbourhoods. Our results show that ZSL-KG improves over existing WordNet-based methods on five out of six zero-shot benchmark datasets in language and vision.
Code Repositories
Benchmarks
| Benchmark | Methodology | Metrics |
|---|---|---|
| generalized-zero-shot-learning-on-apy-0-shot | ZSL-KG | Harmonic mean: 61.57 |
| generalized-zero-shot-learning-on-awa2 | ZSL-KG | Harmonic mean: 74.58 |
| generalized-zero-shot-learning-on-bbn-pronoun | ZSL-KG | F1: 26.69 |
| generalized-zero-shot-learning-on-ontonotes | ZSL-KG | F1: 45.21 |
| zero-shot-learning-on-apy-0-shot | ZSL-KG | Top-1: 60.54 |
| zero-shot-learning-on-awa2 | ZSL-KG | average top-1 classification accuracy: 78.08 |
| zero-shot-learning-on-snips | ZSL-KG | Accuracy: 88.98 |
Build AI with AI
From idea to launch — accelerate your AI development with free AI co-coding, out-of-the-box environment and best price of GPUs.