← return to transmissions
> TRANSMISSION 0x0040 // 2024.02.04 // CLASSIFICATION: LEARNING

Open Source AI Creates Personalized Travel Content on Consumer Hardware

1 MIN/
learningpersonalopensourcegenaitravel

πŸ”“ Showcasing the potential of open source GenAI for crafting personalized travel content.

🌏 Destination: Tokyo! Our fictional traveler, Max, embarks on his vacation to Japan. He chooses fish from the flight on-board menu. He also watched sports and played video games on the onboard entertainment system.

πŸ“œ Used the OpenHermes Mistral 7B LLM via the text-generation-webui to bring Max's Tokyo experience to life with a vivid and personalized set of key frame prompts like: "150": "digital art, a 30 year old man buying fish at Tsukiji Fish Market , Fresh Seafood Vibes, Busy Traders, Unique Ingredients, vivid colors, (high-resolution:1.2), ultra realistic"

🎬 Used Stable Diffusion and deforum extension to create the animation. Provided the prompts from the LLM, and added settings for camera movement. Deforum takes care of the tweening (β€œinbetween”), or creating, the images from one keyframe to the next which are described by the prompts.

🎀 Tortoise TTS for text-to-speech voice over intro welcoming Max to Tokyo.

All tools self-hosted on consumer hardware (RTX3090), using less than 10Gb VRAM, video rendered in 1 hour.

I'm curious how the above could be stitched together in an automated pipeline to scale presenting the output in-flight to passengers nearing arrival time.

Watch on YouTube