GitHub - aws-samples/demo-dancing-robot-using-generative-ai

Dancing Robot using Generative AI

We show a demo of Generative AI using Robot where the project name of demo is Robo-SoulMate.

The overall architecture is as follows:

The Voice Interpreter and IoT Controller are installed as components on AWS IoT Greengrass, and the AI Controller is configured as a JavaScript on the robot's Chrome browser.
The visitor's voice is converted to text using the Voice Interpreter. The Voice Interpreter uses Amazon Transcribe to convert speech to text to minimize latency.
The text is then delivered to the AI Controller via a pubsub method using CloudFront - API Gateway - Lambda (redis) - Amazon ElastiCache.
Gestures are analyzed locally, and the results are delivered to the AI Controller.
For image analysis other than gestures, the AI Controller captures images and interprets them using Bedrock's Claude3 (Sonnet) via CloudFront - API Gateway - Lambda (gesture).
The visitor's voice is converted to text and displayed on the screen in a chatbot format, allowing the user to easily understand how the AI Robot operates.
The converted text is processed using Bedrock's Claude3 (Haiku) via WebSocket through API Gateway - Lambda (chat). Prompt Engineering is used to handle various user intents.
The final result is delivered to the user using Amazon Polly.
The Dancing Robot performs various actions using the IoT Controller.
User access and related metrics are collected through DynamoDB and displayed on QuickSight.
IoT-related information can be viewed by visitors on a separate dashboard along with various IoT information using IoT SiteWise, etc.

These robots have personalities based on the 8 MBTI types and have names derived from AWS's representative services such as Lambda, SageMaker, and S3.

It is a demo vodeo for SoulMate.

robo-soulmate.mp4

Controller

The operation of the Robot Controller according to the user's Reaction is described in robot-controller.md.

Photo Booth

In Photo Booth, Stable Diffusion images are generated. For more details, please refer to photo-booth.md.

Security

See CONTRIBUTING for more information.

License

This library is licensed under the MIT-0 License. See the LICENSE file.

Name		Name	Last commit message	Last commit date
Latest commit History 14 Commits
lambda-chat-ws		lambda-chat-ws
lambda-controller		lambda-controller
lambda-delete-items		lambda-delete-items
lambda-emotion		lambda-emotion
lambda-gesture		lambda-gesture
lambda-gethistory		lambda-gethistory
lambda-greeting		lambda-greeting
lambda-photo		lambda-photo
lambda-polly		lambda-polly
lambda-provisioning		lambda-provisioning
lambda-query		lambda-query
lambda-reading		lambda-reading
lambda-redis		lambda-redis
lambda-score-chat		lambda-score-chat
lambda-score-gesture		lambda-score-gesture
lambda-score		lambda-score
lambda-translation		lambda-translation
lambda-upload		lambda-upload
lambda-voice-provisioning		lambda-voice-provisioning
lambda-voice-ws		lambda-voice-ws
pictures		pictures
CODE_OF_CONDUCT.md		CODE_OF_CONDUCT.md
CONTRIBUTING.md		CONTRIBUTING.md
LICENSE		LICENSE
README.md		README.md
photo-booth.md		photo-booth.md
robot-controller.md		robot-controller.md

License

aws-samples/demo-dancing-robot-using-generative-ai

Folders and files

Latest commit

History

Repository files navigation

Dancing Robot using Generative AI

Controller

Photo Booth

Security

License

About

Resources

License

Code of conduct

Security policy

Stars

Watchers

Forks

Languages