3 months ago

Enhancing Interpretable Clauses Semantically using Pretrained Word Representation

Rohan Kumar Yadav Lei Jiao Ole-Christoffer Granmo Morten Goodwin

Abstract

Tsetlin Machine (TM) is an interpretable pattern recognition algorithm based on propositional logic, which has demonstrated competitive performance in many Natural Language Processing (NLP) tasks, including sentiment analysis, text classification, and Word Sense Disambiguation. To obtain human-level interpretability, legacy TM employs Boolean input features such as bag-of-words (BOW). However, the BOW representation makes it difficult to use any pre-trained information, for instance, word2vec and GloVe word representations. This restriction has constrained the performance of TM compared to deep neural networks (DNNs) in NLP. To reduce the performance gap, in this paper, we propose a novel way of using pre-trained word representations for TM. The approach significantly enhances the performance and interpretability of TM. We achieve this by extracting semantically related words from pre-trained word representations as input features to the TM. Our experiments show that the accuracy of the proposed approach is significantly higher than the previous BOW-based TM, reaching the level of DNN-based models.

Code Repositories

cair/PyTsetlinMachineCUDA

Mentioned in GitHub

cair/pyTsetlinMachineParallel

Mentioned in GitHub

cair/TsetlinMachine

Mentioned in GitHub

cair/pyTsetlinMachine

Mentioned in GitHub

ckinateder/pytsetlinmachineparallel

Mentioned in GitHub

cair/pyTsetlinMachineMT

Mentioned in GitHub

Benchmarks

Benchmark	Methodology	Metrics
sentiment-analysis-on-mr	TM-Glove	Accuracy: 77.51
text-classification-on-r52	TM-Glove	Accuracy: 89.14
text-classification-on-r8	TM-Glove	Accuracy: 97.50
text-classification-on-trec-6	TM-Glove	Error: 9.96

Build AI with AI

From idea to launch — accelerate your AI development with free AI co-coding, out-of-the-box environment and best price of GPUs.

AI Co-coding

Ready-to-use GPUs

Best Pricing

Get Started

Hyper Newsletters

Subscribe to our latest updates

We will deliver the latest updates of the week to your inbox at nine o'clock every Monday morning

Command Palette

Enhancing Interpretable Clauses Semantically using Pretrained Word Representation

Rohan Kumar Yadav Lei Jiao Ole-Christoffer Granmo Morten Goodwin

Abstract

Code Repositories

Benchmarks

Build AI with AI

Hyper Newsletters