Universal Sentence Encoder by Ray Kurzweil’s Team at Google

Universal Sentence Encoder by Ray Kurzweil’s Team at Google

  • March 31, 2018
Table of Contents

Universal Sentence Encoder by Ray Kurzweil’s Team at Google

We present models for encoding sentences into embedding vectors that specifically target transfer learning to other NLP tasks. The models are efficient and result in accurate performance on diverse transfer tasks. Two variants of the encoding models allow for trade-offs between accuracy and compute resources.

For both variants, we investigate and report the relationship between model complexity, resource consumption, the availability of transfer task training data, and task performance. Comparisons are made with baselines that use word level transfer learning via pretrained word embeddings as well as baselines do not use any transfer learning. We find that transfer learning using sentence embeddings tends to outperform word level transfer.

With transfer learning via sentence embeddings, we observe surprisingly good performance with minimal amounts of supervised training data for a transfer task. We obtain encouraging results on Word Embedding Association Tests (WEAT) targeted at detecting model bias. Our pre-trained sentence encoding models are made freely available for download and on TF Hub.

Source: arxiv.org

Tags :
Share :
comments powered by Disqus

Related Posts

AWS Public Datasets

AWS Public Datasets

AWS hosts a variety of public datasets that anyone can access for free. Previously, large datasets such as satellite imagery or genomic data have required hours or days to locate, download, customize, and analyze. When data is made publicly available on AWS, anyone can analyze any volume of data without needing to download or store it themselves.

Read More
AI Cardiologist Aces Its First Medical Exam

AI Cardiologist Aces Its First Medical Exam

When both the AI and expert cardiologists were asked to classify the images, the AI achieved an accuracy of 92 percent. The humans got only 79 percent correct.

Read More