MIDS Capstone Project Summer 2022

eXpressMe! AI Driven Intent-Predicting Speech Assistant

Team members

Our project uses cutting edge kinetics-trained computer vision models, combined with natural language GPT2 sentence generation. We are re-imagining the speech assistants for teenagers having expressive communication difficulties. The app takes a video clip of 8 seconds duration, applies kinetics model on top of it, infers the possible actions. Using the actions, we provide subject (I, we, they) and let the speech API generate simple expressive sentences. (Can I eat hot dog, Shall we eat donuts).

Course

Data Science 210. Capstone , Summer 2022

Class Project Gallery

Eating Donuts!

Video

If you require video captions for accessibility and this video does not have captions, click here to request video captioning.

Last updated: August 17, 2022