Layla v7 alpha opens mini-app development to the public
The first Layla v7 alpha build is available for alpha testers: Download APK
Layla 7 opens mini-app development to the public via the Layla SDK, designed to integrate with Layla's LLM, characters, image generation, agents, memory, and more.
This first alpha includes LLM, character, and image generation APIs, with more SDK capabilities planned across the next alpha versions.
- Published the Layla SDK for building custom mini-apps that run inside Layla.
- The SDK exports an agent skill compatible with Codex, Claude Code, and OpenCode for AI-assisted mini-app creation.
- Added support for custom mini-apps as a single HTML file, a zip folder containing assets, or a self-hosted website.
- Published a Layla mini-app template for more complex projects.
- Added search features to the Your Mini-apps screen and Browse Mini-apps screen.
- Mini-apps can now be organized into folders, similar to character folders.
The Roleplay mini-app has been completely redesigned
The Roleplay mini-app has been completely redesigned: https://www.layla-network.ai/post/roleplay-mini-app-redesign
- Choose from 50+ templates to start your roleplay sessions: romance, adventure, isekai, comedy, interview practice, language learning, and more!
- Roleplay now supports local models (GGUF), connecting to your PC/OpenAI, Layla Server, or Layla Cloud.
- Edit/create your own scenarios with an all new editor - create temporary characters directly within the Roleplay mini-app, or attach your existing characters to the scenario. (Your temporary characters will be automatically put into a folder with the name of your Roleplay in the character list)
- Visual Novel mode: centres the character image as they speak/act.
- OOC Instructions: you can add out-of-character instructions at any time during the roleplay, useful to keep the AI on track.
- TTS: text-to-speech for individual characters in their own voices (configuration in the character edit)
- Roleplay supports custom agents and image generation.
- Improved long-term memory summarisation speed.
- Added Gemma4 (thinking) preset prompt.
- Redesigned the Saved Inference Engine section to easier filter by attached characters.
- Improved TTS consistency by removing unnecessary stopping mid-sentences.
- Updated Local Dream to add image editing features.
- Added new schedulers (Kerras versions) in Local Dream.
- Fixed a bug where the model selector disappears after searching for available models when configuring OpenAI API.
- Fixed a bug where chat bubble styles stops showing when performance counter is enabled.
- Fixed a bug where changing the prompt but not the seed in image generation does not generate a new image.
- Fixed a bug where "Speak Responses" in chat actions cannot be toggled off.
- Fixed a bug where "Push to talk" had stopped working.
Layla supports local voice cloning with PocketTTS!
Layla supports local voice cloning with PocketTTS! https://www.layla-network.ai/post/layla-v6-11-0-has-been-published
Create your own voices by recording or uploading an audio sample. Layla will create a custom voice out of it for which you can use for your characters! You can also set your custom voice as the the global TTS voice for Layla, or configure it individually for each character.
Record audio right in Layla to create custom voices: you can record an audio snippet and use it as a base to clone a voice.
Voice Cloning tips:
PocketTTS is very sensitive to background noise. Record your audio in a quiet room or use an online editor to remove background noise to get better results.
Only supports 16-bit PCM wav files, use an online converter if you need to convert mp3's etc.
Try not to have blanks/silences in the sample, PocketTTS will try to learn that and generate lots of silences.
Exaggerated voices work better.
- Companion mode now supports full screen chat; this gives a similar experience as the normal chat page in Layla. You can open the full screen chat by double tapping on the companion floating icon, or long-pressing on the companion floating icon, or tapping the single message bubble.
- Layla as your phone's default assistant now supports Agents.
- Companion mode now supports Agents.
- Added ability to configure chat-bubble colours in UI settings.
- Gallery of generated images in Layla are now backed up.
- Flipped the user/AI position of chat bubble styles to match the standard for other social media/chat apps.
- Image gallery now blocks other apps from taking a screenshot or recording of it's contents, and will also automatically hide its contents when Layla goes into the background.
- Image gallery in Layla allows multi-selection for deleting images.
- Improved TTS latency by implementing audio streaming.
- You can re-order characters inside folders and drag them outside to remove them from the folder.
- Improved character impression generation by limiting the max output and cutting off incomplete sentences.
- Stable Diffusion prompts are saved in generated images gallery.
- Fixed a bug where Layla Cloud image generation fails to read the generated images after a few attempts.
- Fixed a few crashing issues when switching between Live2D characters quickly.
- Fixed a bug where default Automatic1111 json body stopped working.
- Fixed a bug where setting a max response length does not trim incomplete sentences.
- Fixed a bug where previewing native TTS using Google/iOS was not using the selected voice properly in other languages.
- Fixed a bug where sometimes asterisks were not being skipped in TTS when skip asterisk setting on.
- Fixed a bug where you cannot use a generated image as the input of the next img2img generation.
Layla x Local-Dream!
Layla x Local-Dream! https://www.layla-network.ai/post/layla-v6-10-0-has-been-published
The creator of Local-Dream (https://github.com/xororz/local-dream) has graciously allowed direct integration with Layla.
- Image generation can now be done via the new Local Dream mini-app. This adds a host of new features to Layla's image generation process:
- SDXL NPU support.
- Upscaling + different resolution selection (from 256 -> 1024)
- Image to image generation.
- Different sampler support (Euler, LCM, and DPM)
- Denoising config.
- Dreams now allow using a custom Inference Engine to generate new messages.
- Improved custom prompt handling when using a custom Inference Engine to process LLM summaries.
- Improved Agents mini-app screen with search and filtering.
- Added a File Manager mini-app so you can see exactly what's using storage in Layla.
- Added confirmation popup when deleting mini-apps.
- Added an extra setting that allows automatic use of your character's profile image with img2img generation when sending you images in chat.
- Fixed a crash in LTM ingestion process with prompts that are too long.
- Fixed a bug where it's hard to scroll if you have multiple saved prompts.
- Fixed a bug where selecting from multiple inference engines when starting a chat does not scroll.
- Fixed a bug where LiteRT models were not showing thinking process.
Introducing "Companion Mode" in Layla. (Android only.)
Introducing "Companion Mode" in Layla. (Android only.)
Read more here: https://www.layla-network.ai/post/layla-v6-9-0-has-been-published
- Layla's companion mode allows you to select a character to run alongside you as you use your phone! Your companion will appear on top of other apps.
- Your companion runs your selected LLM and voice option, with a vision model that is capable of looking at your phone screen. They can even play games with you!
- Tapping your companion will trigger a screenshot, where your companion will then listen to your voice and look at the screenshot automatically to give you tips, ideas, or simply chit-chat! Your companion supports animated Live2D models, making them react to what they see.
- Layla's Companion Mode supports all Inference Settings in Layla: GGUF, LiteRT, connecting to your PC, or Cloud.
- Demo: watch Layla playing Plague Inc. with you: https://youtu.be/gK9sdkjU-vM
- Advanced settings allows adjusting LLM and VLM hardware acceleration separately, allowing you to put the LLM on the CPU while the VLM on the GPU.
- Allow adjusting running model on CPU/GPU for LiteRT models.
- Context length settings are now applied to LiteRT models.
- Supports Gemma-4 tool calling agents in both GGUF and LiteRT.
- Allow setting additional llama.cpp command-line arguments in Advanced Settings.
- Additional llama.cpp cmd arguments can be saved as Advanced Settings presets.
- Fixed a few "Failed to eval" bugs on long context usage.
- Fixed a few UI issues that happens when the screen is horizontal.
- Fixed a bug where viewing a memory that is too long will soft-lock you without any way to exit the chat.
- Fixed a bug where OpenCL crashes on older GPUs.
- LTM now removes think tags from Gemma 4 if used as summariser.
This update bring Gemma-4 support, improves agent functionality, with a new Choose Your Own Adventure agent, and...
This update bring Gemma-4 support, improves agent functionality, with a new Choose Your Own Adventure agent, and improves chatting UI with more customisations. Read more here: https://www.layla-network.ai/post/layla-v6-8-0-has-been-published
- Supports the latest Gemma-4 model family (GGUFs and LiteRT-LM)
- Added new Choose Your Own Adventure characters.
- Added German translations.
- Added ability to send progress messages during python execution, you can view this progress in the Agent execution card to give you more insight into what the agent is doing.
- Layla Python can now generate files.
- New Python agents will automatically install needed dependencies when added for the first time.
- You can configure long-term memory to use a saved inference engine as summariser.
- Added UI setting to display chats in a bubble-like interface instead (similar to Facebook Messenger or WhatApp etc.)
- Preset characters are now organised into folders.
- Fixed a bug where sometimes character expressions will not be saved.
- Fixed a bug where some memories cannot be processed.
- Fixed a bug where CPU SD models cannot be run with LLM on the NPU.
- Fixed a bug where attached inference settings are not applied when using a custom character as your phone's default assistant.
This update fleshes out Layla's Agent Framework.
This update fleshes out Layla's Agent Framework.
- Agent System Prompts.
- Agents now have their own system prompts. These will update your character's system prompt when they are activated. For example, you don't need to update both your system prompt and the agent for status cards; the system prompt to guide the model into generating status cards will automatically be injected when you attach the corresponding agent.
- Introducing Short-term Memory.
- Why do we need a short-term memory when there is the LLM context itself? The main difference lies in the fact that Layla's short-term memory is structured. Both the LLM context and LTM is unstructured, and lookup is done purely via heuristics (via the Attention layer in LLMs, and embeddings in LTM). The short-term memory is structured a layer that lives between the LLM's context and Layla's LTM.
- Information flows from the LLM's context -> short-term memory -> LTM (as demonstrated by one of the example agents below). This can be updated by both the LLM and Layla's Agents. Because short-term memory is structured, you can use code and logic to manipulate the contents, not just via prompts. This opens up the possibility to program information flow. An example could be writing an agent that updates a character's Hit-Points or inventory based on triggering conditions.
- This layer will also serve as a sort of RAM for all agents. Any agent can read/write into Layla's short-term memory, allowing a horizontal layer of communication between agents (work-in-progress). This is now possible because...
- Layla supports Python.
- Added setting to disable system prompt in Inference Settings -> Custom Prompts: this will disable all character and user information. This is useful if you want to chat with the "raw model" without any instructions. Note this also stops most apps in Layla from working properly.
- Redesigned the model selection area, now with the ability to export your imported models.
- Claude API now allows sending images in chat.
- Added Japanese and Vietnamese translations.
- Added ability to organise characters into folders.
- You can tap on "memories" in your character card to immediately view their memories in the LTM app.
- Added small UI sounds to notify you when Layla is listening or finished speaking.
- Allow adding custom JSON fields to OpenAI endpoints.
- Fixed a bug where long prompts will crash LTM ingestion process with a custom LLM.
- Fixed bug where Layla as default assistant on phone does not work with multiple inference engines attached to the character.
- Fixed bug where setting default character was not working with using Layla as default assistant on phone.
- Fixed a bug where custom SD models cannot be imported via the "Already downloaded" link.
- Read more here: https://www.layla-network.ai/post/layla-v6-7-0-has-been-published
- Our agent-showcase channel contains new agents that work with Layla's Python support!
- Layla v6.7.0 changelogs: Python support, new agentic framework upgrades, various UI improvements and bug fixes.
This update adds support for LiteRT-LM models, improves model loading times, and contains numerous improvements and...
This update adds support for LiteRT-LM models, improves model loading times, and contains numerous improvements and bug fixes. Read more here: https://www.layla-network.ai/post/layla-v6-6-0-has-been-published
- Adds experimental support for LiteRT-LM models.
- LiteRT-LM (.literlm) models can be added in the Inference Settings the same way as you would a GGUF model, Layla will use the LiteRT-LM engine automatically.
- LiteRT support is experimental, most features are not implemented yet.
- Improved download stability of new model downloads in the welcome screen and stable diffusion model downloads.
- Added TTS setting for a global default voice.
- Added the ability to given your character/LLM a custom instruction during chatting in Chat Actions.
- Added UI setting to control text area expansion in voice chat.
- <think> content is removed when reloading the chat.
- TTS now skips speaking unpronounceable characters (such as ASCII art, tables etc.)
- Fixed a bug where Dreams keep scheduling messages for the same character more than once.
- Fixed a bug where different model options in Inference Settings cannot be scrolled sometimes.
- Fixed a bug where "Speak Responses" chat actions were not working.
- Fixed a bug where searching chat history does not search through all chat histories.
- Fixed a bug where GPT-OSS model was not generating responses.
- Fixed a bug where if you cancel voice chat before it initialises the UI dark cover is gone.
- Fixed a bug where edited memories are still using old embeddings, causing recalled content to be different to queried content.
- Fixed a bug where reloading chat sometimes do not read from cached session and instead loads everything from scratch.
- Fixed prompt format for GPT-OSS.
- Fixed a bug where Layla as your phone's default assistant was not working properly.
This update redesigns the character selection screen, giving you detailed information on their statistics such as...
This update redesigns the character selection screen, giving you detailed information on their statistics such as number of messages and past memories. Read more here: https://www.layla-network.ai/post/layla-v6-5-0-has-been-published
- Redesigned character selection page to give you detailed information of your character before starting the chat.
- Added ability to save custom sampler settings in the Advanced Settings screen.
- Added ability to attach saved sampler presets as part of your custom Inference Engine.
- Supports sending images to OpenAI compatible API inference settings.
- Fixed a bug where some app data was not deleted when removing a mini-app from Layla.
- Fixed a bug where LTM sometimes crashes when handling very very long messages.
- Fixed a bug where images generated during chat were not being backed-up.
- Fixed a bug where tags from TavernPNG were not being imported.
- Fixed a bug where backup cannot copy references with spaces or special characters in their name.
- Fixed a bug where automatically converting JPGs when saving as TavernPNG was giving an error.
- Fixed a bug where personas were not being saved as part of your Inference Engine.
- Fixed a bug where upscaling anime images caused a crash.
- Fixed a bug where chat with LTM crashes on the latest Snapdragon Gen5 CPU.
- Changelog for Layla v6.5.0: new character selection design, ability to save custom sampler settings, and general bug fixes and improvements.
This update redesigns the long-term memory graph. It is now in 3D! If you'd like to learn more about the...
This update redesigns the long-term memory graph. It is now in 3D! If you'd like to learn more about the visualisation of your memories, read here: https://www.layla-network.ai/post/layla-v6-4-0-has-been-published
- Visualisation of temporal relationships in Layla's long-term memory.
- Significant improvements to the performance of chatting, especially for very long conversations spanning hundreds/thousands of messages.
- Long-term memory will now recall earlier memories from the same conversation that has been cycled out through context shifting.
- Fixed a few conditions in triggering new search agent.
- Supports the new Ministral-3 family [THINK] tags.
- Fixed a bug where invalid filename in References mini-app causes Layla to crash.
- Fixed a bug where characters list will sometimes show nothing when updating a filter.
- Fixed a bug where character list resets to the first page after edit/saving a character.
- Fixed a bug where regenerate messages leaves a large amount of empty space when the length of the regenerated message varies largely.
- Fixed a bug where automatically searching for models in OpenAI compatible API endpoints was not working.
- Fixed a bug where Layla does not stop the server from generating messages even after pressing "stop".
- Fixed a bug where voice chat does not speak aloud responses.
- Fixed a bug where audio is only played through speakers and not AirPods.
Layla supports Qwen3-VL family models: images are automatically down-sized before sending to the LLM to keep...
- Layla supports Qwen3-VL family models: images are automatically down-sized before sending to the LLM to keep processing times reasonable on mobile, you can control how much down-sizing occurs in Advanced Settings.
- Layla supports animated PNGs and GIFs for your character cards.
- Added setting in Reference documents to control the threshold of how long a document needs to be before it is summarised instead of injected wholly into the context.
- Prompt templates are now case insensitive.
- Display character full name in chat instead of just the first name.
- Automatically convert JPGs to PNGs when exporting your characters as a TavernPNG.
- You can share text with Layla to start a chat with that text.
- Fixed a bug where the document name was not detected correctly when adding a document from chat.
- Fixed a bug where character sorting changes back to the default after exiting chat.
- Fixed a bug where tool calls are included in long-term memory messages.
- Fixed a bug where Layla crashes on importing a huge TavernPNG card.
- Read more here: https://www.layla-network.ai/post/layla-v6-3-0-has-been-published
Roleplay app gives much more flexibility - choose who talks next, speak as any of the participants.
- Roleplay app gives much more flexibility - choose who talks next, speak as any of the participants.
- Roleplay now supports speaking out loud responses with the individual participants voice.
- Roleplay app now supports using AI to generate your scenarios and images.
- View publicly submitted data for Mobile Model Benchmarks.
- You can submit your own benchmarks by running models through the Model Benchmark mini-app.
- In the future, this data will be used to recommend the best models for your phone.
- (All data submission is voluntary, NOT automatically collected).
- Added ability to search for supported models in your OpenAI API endpoint inference settings.
- Diagnostics now show a breakdown of storage used by Layla.
- Added ability to stop LTM from auto-collecting memories, allowing more fine-grained control.
- Fixed a few bugs in Dreams scheduling and improved display of already scheduled Dreams.
- Fixed a bug where context prefix inference settings were not getting saved.
- Fixed a bug where character search in character selection switching back to the first page.
- Fixed a bug where character expressions cannot be set.
- Fixed a bug where enter key does not clear the sent message.
- Read more here: https://www.layla-network.ai/post/layla-v6-2-0-has-been-published
- Demo: https://www.youtube.com/watch?v=H9wHBHbZZ0U
- Demo of group chat feature in Layla.
Layla can now replace Google Gemini as your default assistant! You can do numerous tasks on your phone such as...
Layla can now replace Google Gemini as your default assistant! You can do numerous tasks on your phone such as drafting emails, creating reminders etc. without opening the main app by simply holding the power button.
Configure this by going to Android System Settings -> Digital Assistant -> choose Layla.
- Layla can replace Google Gemini as your phone's default assistant! (Go to Settings -> Default Digital Assistant -> configure it to use Layla)
- Press and hold the power button to bring up Layla!
- Combined with the new agentic framework in v6, Layla can complete various tasks without opening the main app.
- As I expand the agentic framework, Layla will be able to complete more tasks on your phone.
- Your Inference Settings are brought over to Layla Assistant; all models supported by Layla will work, whether it be local GGUF, connected to your PC, or Layla Cloud.
- Personalised assistant (hint: set a default character in Layla, the assistant will bring up "Ask [your character]" instead of Layla)
- Layla now supports the stable diffusion model format by local-dream app (you can use both Layla and local-dream converted models interchangeably)
- Improved set alarm agent.
- Fixed small bugs in the calendar agent.
- Layla agents support RSS feeds.
- You can long-press on images generated during chat to save them.
- You can preview images generated in stable diffusion mini-app full-screen.
- MobileVLM is updated to v2, improving image recognition capabilities.
- Fixed a bug where idle time settings were not being saved in Voice Chat.
- Fixed a bug where agents stop on the first LLM tool call.
- Fixed a few bugs related to edit and regen messages.
- Read more here: https://www.layla-network.ai/post/layla-v6-1-0-has-been-published
Layla 6 supports Agents!
Layla 6 supports Agents!
- Agents are fully configurable, self-contained workflows that can be triggered and executed by Layla during chats. Their functionalities can range from injecting simple context, to fully automated workflows such as reading a webpage and saving it as a reference document. Each agent is also modular: you can attach them to your own characters; they will still complete their respective tasks, but under the personality of your attached character.
- An agent is comprised of two parts: triggers and tools. An agent can have multiple triggers and call multiple tools. You are free to mix and match them in any combination when creating your own agents!
- This version also includes a brand new long-term memory implementation that is much faster than before!
- You can read more about the updates here: https://www.layla-network.ai/post/layla-v6-0-0-has-been-published (this article includes links to understand how Agents work and getting started on creating your own Agents)
- This updates also includes numerous bug fixes and improvements, in particular performance improvements to chatting and UI consistency.
- I will be adding more tools and expanding the capabilities of agents over the next updates and would love to hear your feedback and ideas on new agents/tools to add to Layla!
Read more about the updates here
Read more about the updates here: https://www.layla-network.ai/post/layla-v5-6-0-has-been-published
- Layla supports Live2D models! Take your characters to the next level with customisable animated models! (Android only)
- Smooth animations.
- Responds to your phone (shake your phone to see how your character reacts!)
- Links expressions with Layla's sentiment system to change animations based on your conversations.
- Lip sync with output audio.
- Added a few user requested QNN Stable Diffusion models.
- Added support for Kroko ASR (speech-to-text model, this is now the default STT model in Layla)
- Ability to configure KV Cache quantisation settings in Advanced Settings.
- Set "show nsfw" option now hides any characters tagged with nsfw.
- Added French, Portuguese, and Spanish UI translations.
- Fixed bug where the correct number of characters in Roleplay was not showing.
- Fixed bug where chat images cannot be expanded when tapped on.
- Fixed bug where slightly moving your finger during hold-to-speak mode stops Layla from listening.
- Fixed bug where sometimes phone native voices are not being shown or not working in text-to-speech.
Read more about the updates here
Read more about the updates here: https://www.layla-network.ai/post/layla-5-5-0-has-been-published
- App UI has been translated to multiple languages! Currently supports: English, Chinese, Korean, Polish, Russian.
- Offline Transcription: transcribe audio such as lectures, audiobooks, or videos in real time.
- All preset characters now have animated backgrounds!
- Allow configuring minimum silence length and other settings for speech-to-text.
- Made stop generating slightly more responsive.
- Improved UI performance when generating responses, should reduce speed inconsistencies where response times sometimes start slow and speeds up at the end of the message.
- Added TopK in advanced settings.
- Fixed bug where DuckDuckGo news search was not working.
- Fixed bug where multiple regens make the model increasing incoherent.
- Fixed bug where text-to-speech sometimes pauses unnecessarily after the first sentence.
- Delayed on Google Play.
- Layla 5.5.0 has been published!
Read more about the updates here
Read more about the updates here: https://www.layla-network.ai/post/layla-v5-4-0-has-been-published
- Gemma3 family models support!
- NPU image generation now supports the latest Snapdragon Elite CPU.
- Added new NPU stable diffusion models.
- Added new CPU stable diffusion models.
- Layla supports displaying LaTeX formulae in chat messages.
- Re-compiled several stable diffusion models with their custom VAE decoders, which will produce better image quality.
- Backup data will now backup your custom prompts.
- Turning nsfw option off in settings will also hide nsfw characters in your character selection screen.
- Added ability to copy generated code in chat messages.
- Added support for MultiTTS voices.
- Added ability to continue generating a message from the LLM.
- Fixed a bug where phones without access to Google cannot access Layla Cloud subscriptions.
- Fixed bug where GPU inference was not working with Background Inference.
Details on the updates are here
Details on the updates are here: https://www.layla-network.ai/post/layla-v5-3-0-has-been-published
- Added a new animated character.
- Added ability to duplicate characters (including preset characters and animated characters, which will allow you to use the animation for your own characters)
- Added XTC samplers in Advanced Settings.
- Added support for Phi-4 family models.
- Added feature to generate image prompt using your selected LLM in SD mini-app.
- Improved animated character lips movement; lips will start moving when sound is played instead of on LLM output.
- Fixed bug where saving backup data directly to file system does not work for very large backups.
- Fixed bug where character info is not updated after editing until a reload.
- Fixed bug where deleting a character does not refresh the characters list.
- Fixed bug where TTS voices were not playing in phone silent mode.
- Fixed bug where TTS voices do not play after stopping it once.
- Fixed bug in image generation resulting in "workspace alignment" issues.
- Fixed bug where character tags are not showing in the character creator.
- Fixed bug where deleted characters were still sending proactive messages.
Read in detail the new features here
Read in detail the new features here: https://www.layla-network.ai/post/layla-v5-2-0-has-been-published
- Added SD Dreamshaper, Chilloutmix NPU model.
- Added SD Hassaku CPU model.
- Added iniVerse Mix Layla Cloud model.
- Added Cohere API support.
- Added support for importing webpages as Lorebooks.
- Added support for Kokoro TTS model.
- Optimised Character Selection page: added sorting options, optimised loading a large number of characters.
- Fixed bug where {{instruction}} tags were being force injected for some characters.
Read in detail the new features here
Read in detail the new features here: https://www.layla-network.ai/post/layla-v5-1-0-has-been-published
- Layla supports GPU inference! Supports Vulkan and OpenCL backends.
- Layla supports NPU inference for Stable Diffusion!
- Layla supports reasoning models Deepseek R1 family!
- Redesigned Lorebook UI to handle lots of documents better.
- Improved UI of model import.
- Added timestamps to Long-term Memory table view.
- Backup data now directly allows you to choose a folder to save to.
- Added a Download Manager app to give the ability to view/cancel download tasks in case they get stuck.
- Added Whisper Base and Whisper Base (English) models.
- Added ability to configure the language Whisper models listen in.
- Q4_0 quants are now automatically converted on the fly to support your current architecture.
- Allows saving TavernPNG directly to file system in character creation.
- Supports sherpa-onnx TTS engine APK.
- Redesigned chat message quick actions (copy button is now always visible, tap & hold the message to bring up a context menu with more action)
- Create Character (AI) image generation now uses the default negative prompt configured in the SD mini-app.
- Fixed bug when importing chat history.
- Fixed bug in Layla Cloud when handling very long conversation histories.
- Fixed bug where an error in one memory will stop ingestion of all LTM memories.
- Fixed bug where too many quick actions take up all your screen in chat.
- Fixed bug where chat accent colour was not being applied to character responses.
- Fixed bug in default character image generation fallback phrase.
- We're excited to announce significant updates to Layla, bringing powerful new capabilities and improvements across the board. This release focuses on expanding hardware support, enhancing the user interface, and fixing several important issues to provide a more robust experience.Important change in this versionARM quants have now been consolidat...
Read in detail the new features here
Read in detail the new features here: https://www.layla-network.ai/post/layla-v5-0-0-has-been-published
- Layla supports local image generation via Stable Diffusion!
- Layla Cloud supports image generation.
- Characters can now generate images during chat!
- Model Benchmarks app - benchmark your local LLMs on your phone.
- AI Character Creation.
- Re-wrote Long-term memory with significant algorithmic improvements.
- Welcome Screen redesign.
- All mini-apps can now be downloaded in the background.
- Added configurable Lorebook injection prefix and suffix.
- Redesigned Personality Hub, Character Selection screens.
- Added setting to toggle display of NSFW content in Layla.
- Added ability to load GGUF models from external SD card.
- Added advanced setting "max response length".
- Added import/export chat history feature.
- Added UI settings for character text colour.
- Added ability to select multiple messages in chat.
- Selecting the "Default" preset now resets all important settings in the Advanced Settings to their defaults (not just sampler settings)
- Added setting to toggle whether to "tap and hold message to copy".
- Redesigned character tag filtering in Character Selection and Personality Hub.
- Added ability to share debug logs of individual chats.
- Added a special "Cloud" prompt format.
- Fixed bug where greeting was not being imported from TavernPNG.
- Fixed bug where sometimes cloud models do not respond with anything.
- Fixed bug where regen was removing an extra token, causing the regenerated response to be unpredictable.
- Fixed a bug where a chat was "locked" into tool calling mode after the first tool call.
- Layla v5 changelog - local image generation, AI character creation, long term memory overhaul, and more!
Tasker Integration! Layla will show up as a Tasker Plugin
- Tasker Integration! Layla will show up as a Tasker Plugin: https://www.layla-network.ai/post/layla-integrations-with-tasker
- Brave Search API integration (new web search provider)
- Support for OLMoE 1B models (https://huggingface.co/bartowski/OLMoE-1B-7B-0924-Instruct-GGUF)
- Added progress display when fixing ltm corruptions.
- Added estimated time remaining when running long tasks.
- Added option to skip asterisk content during voice chat TTS.
- Improved system prompt handling in cloud APIs to adhere to character better.
- Added ability to mass-import TavernPNG characters.
- Synced llama.cpp to support latest models such as Llama3.2.
- Allow selection of character greetings before starting the chat.
- Added "basic characters" roleplay mode, where you can add short descriptions without creating a separate character.
- Added support for SmartVoice TTS: https://poretsky.github.io/android/smartvoice/index.html
- Roleplay now supports Background Inference.
- Voice Chat app now allows you to configure different Whisper models for speech-to-text.
- Improved "stop generating" button to be able to stop even while processing input.
- Fixed bug when exporting tavern png for custom characters.
- Fixed bug where characters repeat the same sentences during voice chat.
- Fixed bug where characters start listening before finished speaking when using cloud APIs.
- Fixed bug where LTM LLM summary prompt settings were not updated.
- Fixed bug where context length setting was getting reset every time you switch models.
- Fixed bug where chat name was not being saved.
- Fixed bug where things get messed up if you exit a chat before waiting for it to load.
- Layla integrations with Tasker.
- Layla supports Tasker integrations and exposes tasks/events to be called from the Tasker application.
- Bartowski/OLMoE-1B-7B-0924-Instruct-GGUF · Hugging Face.
- Igor B. Poretsky.
- SmartVoice speech synthesizer.
- Multilingual speech synthesizer for Android with automatic language recognition and switching facility.
This update adds SillyTavern Lorebook support!
This update adds SillyTavern Lorebook support! https://www.layla-network.ai/post/layla-v4-16-0-has-been-published
- Layla now supports SillyTavern Lorebooks! You can import them as a document in the Layla Lorebook app.
- SillyTavern Lorebooks work differently than normal documents - instead of using similarity embeddings, SillyTavern Lorebooks are activated via keywords (for more info please read the SillyTavern lorebook documentation on their website)
- Layla's implementation retains this functionality, but adds similarity embeddings in addition to the regular SillyTavern activation logic.
- You can create your own lorebook in SillyTavern on the PC, or download lores other users shared here: https://chub.ai/search
- Added "Push to talk" mode.
- Llama.cpp updated to support Nvidia's distilled Llama3.1 models (https://huggingface.co/nvidia/Llama-3.1-Minitron-4B-Width-Base)
- Added setting to turn off the dark image overlay when showing NSFW images in the Personalities Hub.
- Roleplay app now supports attaching Lorebooks!
- Allow using camera to send images in chats.
- Allow adding Lorebooks to chat on the fly.
- Added ability to import character expressions as zip files.
- Added "keep scroll at bottom during chats" setting.
- Fixed bug where ePub files were not importing correctly with spaces in the document name.
- Fixed bug where long-term memories were not being deleted for a particular character.
- Fixed bug where a long first message does not scroll.
- Fixed bug where Knowledge Graph construction was not respecting the LTM settings.
- Fixed bug when selecting character expressions.
- Nvidia/Llama-3.1-Minitron-4B-Width-Base · Hugging Face.
This update brings support for the i8mm CPU feature. This makes LLMs run much much faster on latest CPUs! Note this...
This update brings support for the i8mm CPU feature. This makes LLMs run much much faster on latest CPUs! Note this requires a special quant called "048". You can read more about how to get them and if your phone supports them here: https://www.layla-network.ai/post/layla-supports-i8mm-hardware-for-running-llm-models
- Layla supports the new Q4_0_4_8 quant which offers increased performance on mobile devices that support i8mm.
- Added biometric authentication in settings (you can lock Layla with a PIN or biometrics)
- Improved performance of Chat page with very long conversation history.
- Added settings in Voice Chat to configure what language native speech-to-text listens for.
- Improved prompt for all Roleplay scenarios to adhere to character better.
- Added ability to choose from multiple engines when starting a chat if you have multiple engines attached to a single character.
- Allow attaching custom inference engines to more than one character.
- Added chat message action "TTS".
- Added translate text chat message action button.
- Phi3 mini models are now supported again.
- Fixed bug where exporting custom characters as TavernPNG was not working.
- App Store released delayed until next version.
- Layla supports i8mm hardware for running LLM models.
- What is i8mm and how to check if your phone supports it.
Details here
- Roleplay app has been updated to v2: introducing Group Chats!
- Create scenarios and add your characters as participants.
- Choose between different levels of interactivity:
- None: don't interact with the group chat, your characters will continue the conversation/roleplay on their own, while you sit back and watch them.
- User: join the roleplay as a regular participant, you will be prompted for your input at regular intervals.
- Narrator: join the roleplay as an omniscient narrator, your input will be wrapped with special [scene] tags in order to advance the scenario, you can choose between allowing characters to continue the scene or narrating changes after every message.
- Character selection screen grid view now shows recently chatted characters.
- Executorch now supports Long-term memory knowledge graph construction, character impressions, Dream, and Layla's Toolkit.
- Synced llama.cpp with latest, merges in changes to support Llama3.1 and Gemma models.
- Lorebooks are now properly backed-up.
- Added viewable chat information for each chat.
- Added Apple OpenELM and Gemma2 2B prompt settings.
- Offline Translator text-to-speech now speaks the translated text instead of the original English LLM output.
- Fixed bug during TavernPNG import with special characters.
- Fixed bug where editing personas were overwriting the default persona.
- Fixed bug where chat history was not opening in character selection screen.
- Fixed bug where the UI gets confused when swiping in memory cluster view and opens the chat history.
- Fixed bug where quick actions were not showing.
- Fixed bug with Lorebook app processing very large PDFs.
- Fixed bug where small app icon layout was bugged.
Details here
- Layla now supports Personas!
- Personas are descriptions of "yourself" that characters know before starting a conversation.
- Create/edit different personas when chatting with your characters, easily switch between them in Inference Settings.
- Integrated with Custom Engines, attach personas to characters to automatically activate them when starting a chat.
- Attached personas will also affect their Long-Term Memories and Dreams.
- Download individual characters as TavernPNG to share privately with your friends.
- Improved chat history display, it is now always accessible by swiping from the left similar to other apps like ChatGPT.
- Fixed bug where some icons are showing up as question marks.
- Fixed bug where Dream messages were disappearing after reading.
This update contains a significant UX improvement when switching models
This update contains a significant UX improvement when switching models: https://www.layla-network.ai/post/layla-v4-12-0-has-been-published
- Layla now supports native multi-modal models!
- Use your custom MMProj image embeddings with models that recognise images.
- Layla's core inference engine has been re-written.
- Raw Model Instructions, OpenAI API, Claude API apps have been removed, new app: Inference Engine combines all model related settings into one app.
- New offline translation models: "English <-> Russian", "English -> Polish".
- With the Inference Engine app, you can configure:
- Models: can be a local file, OpenAI endpoint, Claude API, or Layla Cloud.
- Vision encoders: optionally attach vision encoders to your model: mmproj files for supported GGUF models, MobileVLM for RAG based image recognition for Executorch/Cloud models.
- Prompt settings: fully customisable prompts just like before.
- You can save any combination of the above as a "Custom Engine". This engine can be attached to a character, so you can configure different models/prompts for different characters. This includes being able to configure some characters with local LLMs, while other characters uses Layla Cloud or OpenAI API. Switching is seamless.
- Added table view for Long-term memories.
- Added ability to configure number of injected prompts for LTM and Lorebook.
- Adjusted scroll issues on some phones in the App page and Offline Translator language picker.
The highlight for this update is Layla's Toolkit upgrade to v2
The highlight for this update is Layla's Toolkit upgrade to v2: https://www.layla-network.ai/post/layla-v4-11-0-has-been-published
- Offline translator now supports "English <-> Spanish" and "English <-> Chinese".
- ElevenLabs text-to-speech support!
- Layla's Toolkit has been upgraded to v2! much more accurate tool determination, much faster tool determination (5s -> 0.5s)
- Device calendar integration for scheduling events (uses your default calendar set in your system settings, can be Google Calendar, Samsung Calendar, Apple Calendar etc.)
- Todo app is integrated into Layla's toolkit as a calendar provider.
- Added setting to stop running background tasks when phone is idle.
- Create multiple alternative greetings for a character (greetings are now optional, you can choose to not have a greeting for a character)
- Added character names in the LTM app when viewing memories.
- Added progress bar in Android persistent notification when executing tasks using background inference.
- Added ability to reset character impressions.
- Fixed invalid Executorch models download link.
- Fixed bug where preset characters were not hidden in the chat history screen.
- Fixed bug where roleplay images were not saving.
- Fixed bug where some messages were deleted after editing or regenerating messages.
- Fixed bug where Dream messages revert back to the character's original greeting.
- Fixed bug where Layla Cloud sometimes does not register purchased subscriptions.
This version contains the Lorebook feature, where you can attach multiple documents to your characters to act as...
This version contains the Lorebook feature, where you can attach multiple documents to your characters to act as background information during your chats: https://www.layla-network.ai/post/layla-v4-10-0-has-been-published
- PDF Digest has been redesigned -> it is now called Lorebook!
- Process documents and attach them to characters to provide background information during your chats.
- Added support for the open source search engine: Alexandria.
- Improved performance of task manager when showing a large amount of pending tasks.
- Added configuration to off-load embedding service to the CPU to save RAM.
- Voice chat supports third-party TTS libraries RHVoice and AcapelaTTS.
- Added support for native phone speech-to-text.
- Added ability to copy translated text in Offline Translator.
- Added config to set minimum number of characters to generate before starting TTS.
- Added button to "hold to continue speaking" in voice chat mode.
- Improved UI of Offline Translator (show downloaded models at the top)
- Allow deletion of downloaded translator models.
- Offline Translator now remembers your last selected language.
- Added ability to configure whether to use mmap in advanced settings.
- Fixed bug where tasks are not deleted after deleting a Lorebook PDF.
- Fixed bug where sometimes new messages are duplicated as they are received.
- Fixed bug where re-entering voice chat mode from within the same chat does not start listening.
This version contains numerous new features including a completely redesigned voice chat!
This version contains numerous new features including a completely redesigned voice chat!
- Offline translator: similar to Google Translate but works completely offline!
- Long-term memory update: your characters will now form an impression of you after chatting for a while, you can view this in your Profile page.
- LTM updated to v2! Improves conversation summary processing times significantly (2min -> 5s)
- Redesigned Hands-free Mode: now moved to a dedicated app called "Voice Chat".
- Added high quality TTS models: Piper High.
- Supports the latest Rope scaling implementation of Phi-3 from llama.cpp.
- Layla now supports custom animated character backgrounds and expressions!
- Added ability to view injected memories for your character during chatting.
- Layla will now self-diagnose and fix memory corruptions in Diagnostics mode.
- Added actions to delete all unread messages or mark all messages as read.
- Improved LTM ingestion for Llama3 models (pre-built Layla models still work much better because they are trained to take into account of memories)
- Added ability to exclude certain characters from LTM ingestion.
- Added ability to choose BART or LLM ingestion for long-term memory.
- Executorch models now supports editing messages!
- Executorch now supports background inference!
- Added setting to toggle whether or not your characters recall their last memory when starting a new chat.
- Clear chat history now shows a warning before proceeding.
- Added ability edit memories during chatting.
- Added ability to search voices when selecting.
- Added better support for Native TTS voices.
- Numerous other improvements and bug fixes listed here: https://www.layla-network.ai/post/layla-v4-9-0-has-been-published
Read the detailed updates here
Read the detailed updates here: https://www.layla-network.ai/post/layla-v4-8-0-has-been-published
- Character Expressions app: choose an image for 28 unique emotions your character feels during chatting!
- Quick Actions app: allows you to customise instructions for Siri shortcut and Android OS integration.
- Executorch now uses RMI app to manage models.
- Streamlined task manager task management.
- Task manager now schedules tasks much more consistently.
- Task manager now supports OpenAI API backends for long term memory ingestion, dream scheduling, and pdf digest.
- Improved handling of stop button during chat.
- Advanced settings default preset now resets everything.
- Fixed bug where task manage queued tasks were not displaying properly.
- Fixed bug where background inference was not running properly.
Read more about all the different features here
Read more about all the different features here: https://www.layla-network.ai/post/layla-v4-7-0-has-been-published
- Deeper integration into Android OS! You can select any text in your phone and ask Layla to remember it, explain it, summarise it, or remind you of it!
- Siri shortcut support! Send any text to Layla and chat with her about it, more are coming soon!
- Removed need of floating widget in Background Inference.
- Background Inference now uses Android Foreground tasks, providing consistent notifications.
- Welcome screen now uses your default character images and name.
- Added more configurable options for repetition control.
- Added ability to share conversation history.
- Added ability to edit & regen last message from loaded conversations.
- Added ability to delete chat messages.
- Added pagination and sorting in Personalities Hub.
- Redesigned RMI app to streamline model management.
- Character import now supports TavernPNG v1 cards.
- App screen now blocks you from installing incompatible apps (such as OpenAI + Claude)
- Executorch now supports emojis.
- Fixed bug where Preset boxes were not displaying correctly in Context Extend app.
- Fixed bug where LTM is creating duplicate memories of the same conversation.
- Fixed bug where voice configs were not showing if you only have GB voices installed.
- Fixed bug where todos from imported data are not marked as completed.
- Fixed bug where LTM was not rescheduling deleted tasks.
Read more about all the updates here
Read more about all the updates here: https://www.layla-network.ai/post/layla-v4-6-0-has-been-published
- Phi-3 support!
- PDF Digest app: chat with your PDF documents!
- Executorch Inference Engine - highly experimental app for very high-end phones (improves Llama3 response times by 5x on high end phones such as S23+ Ultra)
- Added ability to create/save custom prompts in RMI app.
- Prompts in RMI app now represents the raw prompts given to the LLM, so you have full control over the prompt format.
- LTM now injects the latest memory into new chats with a character.
- Implementation of DRY sampler: contrasted with MiroStat, which uses a completely different sampling methodology in an effort to reduce repetitions, this sampler targets reducing repetition. Don't use with MiroStat. (Configure DRY in Advanced settings)
- Added new section in settings page to deal with model repetitions, allowing you to control parameters such as repetition penalty, range, DRY, etc.
- Unread messages are not read after opening them.
- Fixed bug in backup feature.
- Fixed bug where the delete button in chat history is hard to click.
- Fixed bug where chat name was not updated after edit.
- Fixed bug "failed to eval" when carrying out long conversations.
- Phi3 GGUFs tested to work with Layla (remember to set the prompt format in the RMI app!): https://huggingface.co/l3utterfly/phi-3-mini-128k-instruct-gguf
- L3utterfly/phi-3-mini-128k-instruct-gguf · Hugging Face.
Llama3 family models support!
- Llama3 family models support!
- Experimental flash attention support (turn on in Advanced settings), ~10% improvement in loading and response times.
- Added ability to re-order apps.
- Added UI settings to show small app icons.
- Keep screen awake during chatting, PDF processing, and other long-running tasks.
- Diagnostics feature now fixes common errors encountered during LTM ingestion automatically.
- Long running processes such as LTM, Tasks Manager, and PDF Digest now has a special interface which automatically dims your screen for processing during the night.
- OpenAI and Claude APIs now support long term memory, PDF digest, and the new edit & regen method.
- Added a fallback phrase for Todo app.
- If you have background inference enabled, task manager tasks will execute in the background as a widget when you click "execute all".
- Added button in the Background Inference app that kills off any floating widgets to handle rare cases were a widget gets stuck and would not exit until your phone is restarted.
- Added ability to execute a group of tasks with one button.
- Fixed bug where chat history sometimes duplicates your messages.
- Fixed bug where toolkit was not working iOS.
- Fixed bug where not all events were being displayed in the Todo app.
- Fixed bug where edited messages were not saving.
- Fixed bug where TTS was crashing the app.
Dream: characters have thoughts outside of chatting, and will initiate conversations with you.
- Dream: characters have thoughts outside of chatting, and will initiate conversations with you.
- Added tag filter in Personalities Hub.
- Adjusted long-term memory build-up for assistant characters to focus on the user.
- Chat screen automatically saves any unsent messages for the character.
- Added ability set character as default (will start chat when clicking the butterfly logo)
- Redesigned "edit & regen" UI.
- Adjusted default character selection background.
- Added ability to edit any past message in conversation.
- Redesigned internal workings of To-do app to provide more intelligent task scheduling and reminders.
- All characters will now use Layla's toolkit if you specifically type the fallback phrase, only characters with toolkit enabled will try to auto-determine which tool to use.
- Fixed bug where menu was appearing on top of hands-free mode.
- Fixed bug where the Writer cannot use Layla's toolbox.
- Fixed bug where to-do notifications were not directing to the to-do app.
- This version contains the new app: Dream!Dream is an app that allows your custom characters to send you messages proactively! They will try to strike up random topics with you based on their personality and/or any long-term memories they have with you. You can configure the rate they message you in the Dream app screen:Additionally, this update...
Added Layla Tiny base model for low-end phones.
- Added Layla Tiny base model for low-end phones.
- Redesigned To-do app to be a list.
- Redesigned the Personality Hub.
- Added warning where filters are applied so you don't think your characters have disappeared.
- Chat history now supports unread messages.
- Re-structured logic for executing background tasks, so hopefully the OS can pick up the task automatically to execute in the background better.
- Added a button in task manager to execute all queued task now.
- Grouped tasks in task manager.
- Chat history screen now uses "infinite loading" (like Facebook feeds) for lots of messages.
- Fixed bug where character selection filters are not being saved in some cases.
- Fixed bug where deleting chat history does not work.
- Fixed bug where roleplay header was not displaying in some cases.
- Fixed bug where character greeting was not applied to OpenAI and Claude APIs.
- Fixed bug where backing up data was stuck with no effect.
- Fixed bug where long-term memory shows duplicate edges.
Full release notes are here
Full release notes are here: https://www.layla-network.ai/post/layla-v4-2-0-has-been-released
- Task Manager app - manage your long term memory ingestion tasks.
- MLC Inference - highly experimental app that replaces Layla's inference engine with MLC.
- Improved ingestion time for long-term memory.
- Long-term memory ingestion will now run continuously during the night when your phone is charging.
- Improved model download to show better progress so it doesn't look like it's hanging.
- Adjusted small grid view to not crop the images too much.
- Added filter by tags in character selection screen.
- Added setting to configure "enter key sends message instead of newline".
- Added ability to configure "n batch" in advanced settings.
- Fixed bug where the download screen would appear briefly before entering the main screen.
- Fixed bug where character selection header does not return to normal when exiting from the create/edit character screen.
- Fixed bug where roleplay templates cannot be edited.
- Fixed bug where some text were getting cutoff on the right side of apps.
- Fixed bug where proactive chat notifications were not opening the chat screen properly.
- Fixed bug where MobileVLM crashes with non-jpg or non-png files.
- Fixed bug where Wallpaper app was conflicting with Character Selection Layout settings.
- Fixed bug where "n threads" in advanced setting was not being applied correctly.
- Fixed bug where context length setting was limited for custom models.
- Layla v4.2.0 has been released!
- This update continues on the path outlined in the last dev diary: Dream - https://www.layla-network.ai/post/dream-layla-s-dev-diary-march-17th-2024In particular there are two features we wish to highlight:Task ManagerThis new app gives you more visibility on what Long-term memory is doing behind the scenes. With this app, you are able to see wha...
This update brings various quality-of-life improvements that are requested by the community.
This update brings various quality-of-life improvements that are requested by the community.
In particular, you can now add/edit/delete long-term memories for individual characters! A new "small grid" display option is added for the character selection screen, with the ability to drag & drop to re-order characters.
- Added ability to add/edit/delete long-term memory clusters.
- Added ability sort characters in character selection screen by long pressing + drag & drop.
- Added "small grid" layout for character selection screen.
- Allow closing character card via back button in Personality Hub.
- Fixed bug where "select character" button in message history page does not work.
- Fixed some app crashes by implementing better exception handling in native code.
This update adds Claude API support! With Claude 3's release, the general consensus is that it is better than...
This update adds Claude API support! With Claude 3's release, the general consensus is that it is better than ChatGPT overall! (See the Chatbot Arena Human Eval).
Additionally, this update focuses on continuing to provide stability updates and resolves some of the issues with image recognition reported here.
- New app: Claude API support (I've heard it's great for roleplay 😉 😉)
- Backup & restore now includes long-term memories.
- OpenAI and Claude APIs now supports long-term memory and image recognition.
- Added warning message on change name with Long Term Memory.
- Hitting "back" on your phone asks if you want to exit the app on Welcome screen.
- Fixed memory leak when using MobileVLM.
- Streamlined MobileVLM image recognition code in an effort to reduce crashes on edge cases.
This is a stability update that addresses numerous issues reported after the v4 update.
This is a stability update that addresses numerous issues reported after the v4 update.
- Improved zero-shot question answers in web search (including answering real-time questions such as stock prices)
- Show size of conversation history next to each entry.
- Fixed bug for file size display.
- Fixed bug where TTS was speaking the "end token".
- Fixed bug where TTS with specific characters crashes the app.
- Fixed bug where model was loading infinitely when Background Inference is enabled without required permissions.
- In particular it fixes a few crashes with specific app configurations. Apologies for the instability during a major version upgrade. If you find any issues, please let me know and I will fix them as soon as possible!
Read more about the update here
Read more about the update here: https://www.layla-network.ai/post/layla-v4-0-0-has-been-published
- Feature details have been extensively posted previously, so the post below are some tips for using the latest version:
- Retrained Base Models.
- Layla Lite and Layla Full has been completely retrained. From v4 onwards, Layla supports the ChatML format for base models by default. The recommended model is now always Layla Lite. I have been testing and using Lite almost exclusively. Yes it has some hallucinations, but nothing that can't be fixed with a quick edit/regen. The increase in speed is very much worth it.
- Context Extension.
- Use with caution, on my S23, it goes up to x4. Any higher sees performance degradation.
- Long-term Memory.
- The Ingestion process happens in the background. However, mobile OS limits the performance of background tasks, so ingestion may be slow. A good way to handle this is leaving Layla in the foreground when your phone is charging (leave Layla on the Welcome screen), this increases the speed of ingestion significantly. Alternatively, you can manually ingest by clicking the button in the LTM app, perhaps before you go to sleep or when you don't need your phone for a while.
In this update, we added a hilarious new app: Anarchy Chess!
In this update, we added a hilarious new app: Anarchy Chess!
We all know LLMs are terrible at chess, you can now experience this first hand by playing with all your favourite characters.
But the main strength of the LLM lies in the fact that it's a language model. Which means it will trash-talk and gaslight you throughout the game 💀.
- New app: Anarchy Chess - play a game of chess with your favourite characters!
- Displays multiple characters at once when Layla is typing instead of one character at a time (this slightly improves typing speed)
- Skip splash screen if model is already downloaded.
- Fixed bug where character backgrounds are being overwritten when downloading from the Personality Hub.
- Fixed bug where profile image was not saving.
- P.S.: I am aware of an issue on android that crashes when TTS is enabled. The fix has been released. Please make sure you've updated to v3.13.1.
- Can you STOP spawning queens FOR 5 SECONDS?!?! Playing a chess game...
Read more on our blog here
Read more on our blog here: https://www.layla-network.ai/post/layla-v3-12-0-has-been-published
- Significantly improved character loading and long message processing times by integrating OpenBLAS, a CPU parallel processing library.
- New app: Piper Text to Voice - English (Great Britain) accent!
- Advanced settings screen now saves and validates settings on exit, instead of as you type.
- Added advanced settings in character creation.
- In this update, we have significantly improved character loading times by integrating a parallel processing library OpenBLAS. OpenBLAS is an optimized BLAS (Basic Linear Algebra Subprograms) library based on GotoBLAS2 1.13 BSD version. See: https://github.com/OpenMathLib/OpenBLAS Since neural networks are ALL linear algebra, by utilising optimis...
Read the details on our blog
Read the details on our blog: https://www.layla-network.ai/post/layla-v3-11-0-has-been-published
Demo here: https://youtu.be/FWpl_W7TOhA
- New app: Background Inference (Android Only)
- Android only: Chat logic has been rewritten to run as a floating widget! This means you can freely switch apps while waiting for Layla to respond! (This is not a default feature; it is enabled by installing the "Background Inference" app in Layla due to some manual configuration that's required)
- Note: doing other tasks while you wait for the response will slow down the response time, as there is no getting around the fact that Layla's thinking is very resource intensive.
- Clicking outside the chat menu closes the menu properly.
- Fixed bug where selecting an image before the model has loaded reverts Layla back to the start screen.
- Fixed bug where "New Chat From Here" does not work when creating from a new chat.
- Fixed bug where new messages are being shown as from the user after force quitting.
- Skipped publish on the App Store (this version's features do not support iOS, improvements and bugfixes will be bundled in the next release)
- In this update, we a huge quality-of-life upgrade for Android users. Using Android's floating widget feature, we are able to move Layla's processing to a background thread. This means you are free to switch apps while waiting for Layla to think! This feature also opens up some unique possibilities of long running resource intensive processes tha...
Read the details on our blog
Read the details on our blog: https://www.layla-network.ai/post/layla-v3-10-0-has-been-published
Demo here: https://youtu.be/nijMsEsAjfs
- New app: MobileVLM - Layla can now recognise images on device, no more sending your images to the server!
- If Layla exits unexpectedly during chat, your last message is automatically set into the text box when you load the conversation history.
- Backend stability and performance improvements.
- Added ability to duplicate chat sessions.
- Fixed bug where if you quit chat unexpectedly, the last message you sent is displayed, causing you to be unable to edit or regenerate last response.
- Fixed bug where context shifting was not adjusting the session properly as conversation approaches the model's context length.
- P.S.: a quick note on the development of long term memory feature - general.
- In this update, we introduce an important feature: local image recognition! Previously, image recognition was done on the server, it has now been moved to your phone! Using the latest MobileVLM model, which stands for Mobile Vision Language Model, you are able to chat with your images. Characters in Layla also understands images and can chat wit...
Read the details on our blog
Read the details on our blog: https://www.layla-network.ai/post/layla-v3-9-0-has-been-published
- New app: OpenAI API app allows Layla to connect to any OpenAI-compatible endpoint!
- Adjusted all preset characters to use TavernPNG format.
- New characters are uploaded to the Personality Hub!
- Video tutorials on how to configure API endpoints for.
- ChatGPT: https://www.youtube.com/watch?v=pn8aMPU7HR4&ab_channel=Layla
- Oobabooga TextGen GUI: https://www.youtube.com/watch?v=np6FcGJvS0E&ab_channel=Layla
- Theorectically, any program which exposes an OpenAI-compatible endpoint should work, such as KoboldCPP, SillyTavern, etc. If you have any issues, please leave a message in the technical channel, I am happy to help!
- In this update, we have added a new app called "OpenAI API". This app allows changing the inference engine behind Layla to any OpenAI-compatible API. You can configure your own API endpoint, API key, and OpenAI model to use. Once this app is installed, instead of running the inference calculations locally on your device, it will call the configu...
- How to connect Layla to ChatGPT.
- How to connect to Oobabooga Textgen GUI from your phone with Layla.
Read the details on our blog
Read the details on our blog: https://www.layla-network.ai/post/layla-v3-8-0-has-been-published
- Added new app: Layla's Toolkit! - this app contains various functions that Layla can use.
- Added new app: DuckDuckGo Web Search.
- Moved MiroStat sampler settings to its own app.
- Added TTS mode: user types, Layla speaks.
- Redesigned the "Welcome" page.
- Changed Layla's chat background to be more neutral.
- Redesigned character loading screen to not hide the chat during load.
- Added ability to quickly switch models in the RMI app.
- Moved Hands Free Mode to the dropdown menu, voice mode next to chat input now activates TTS Mode.
- Wallpaper app now affects the character selection screen and welcome screen.
- Added ability to close voice selection popup without selecting a voice.
- General backend stability and performance improvements.
- Added a small percentage showing Layla's thinking progress.
- Adjusted prompt cache loading strategy to be less sensitive to small changes in prompt.
- Layla will now always use the Precise setting as she is just an AI assistant (other characters respect your settings)
- Fixed bug where Hands Free Mode is not working for iPhones.
- Fixed bug where voice selection input is not shown unless you have VITS VCTK enabled.
- Fixed bug where some operation combos will cause chat history to load very slowly even if you haven't cleared the cache.
- We are excited to announce a new version of Layla! This is a pretty hefty update and includes a lot of things we've been working hard on. In this post, let's go over some highlights. Introducing Layla's Toolkit The highlight of this update is the new app Layla's Toolkit. With this, we introduce a unified framework for Layla to use tools and func...
Read in detail about this patch on our blog
Read in detail about this patch on our blog: https://wix.to/kmjRmij
Demo video of KIP with text to speech (sound on): https://youtube.com/shorts/1S0KiqflBtA?feature=share
- New app: Piper text-to-speech! - medium quality, larger size, 10+ voices.
- New app: VCTK text-to-speech! - lower quality, smaller size, lots of voices (100+!)
- More voice packs are coming soon!
- Goldfish model has been re-trained with significant improvements in cognitive ability.
- Support tablet landscape mode.
- Greeting text in create character screen now supports multi-line.
- Allow creation of characters with empty personality.
- Character background is now set automatically when importing from TavernPNG.
- Added ability to cancel downloading models upon app start.
- Fixed bug where custom character greetings were applied twice (this should improve your character's cognition significantly at the start of the convo)
- Fixed bug where edit chat names was not working on iOS.
- Note: To-do app has been disabled temporarily due to an incompatibility with TTS.
- This version adds over 100+ different local voices for all your characters! Local voices for characters are generated by using the latest text-to-speech technology: VITS. VITS (Variational Inference with adversarial learning for end-to-end Text-to-Speech) represents a cutting-edge approach in the field of speech synthesis, combining several adva...
Read in detail about all the updates in this patch and how they tie into the overall roadmap on our blog
Read in detail about all the updates in this patch and how they tie into the overall roadmap on our blog: https://www.layla-network.ai/post/layla-v3-6-0-has-been-published
Demo video of Annie (new character): https://youtube.com/shorts/qci-66GJeC4?si=XGpv9PfjgWQNTY8T
- Meet the new character Annie!
- 2.5D animated background.
- Sentiment analysis enables her to react in response to your messages.
- New wallpaper app to customise your chat backgrounds.
- Now allows you to choose a base model instead of automatically downloading the last model setting after you reset base models.
- Added ability to regenerate Layla's last response.
- Fixed a bug where character backgrounds were not being shown in grid view.
- Fixed bug where warning on reload chat page was showing after editing chat message.
- This version takes the first step towards animated character models! Meet Annie! A miniature artist with a heart full of dreams. Her room is always cluttered with sketches, pencils, and colouring books. She loves the wild outdoors where she roams with Jasper, her loyal dog. Annie features a custom-made animated background created with a mixture...
- Meet Annie - animated character with in-build sentiment analysis.
Highlights for this update are improvements to the RMI app (Raw Model Instructions)!
Highlights for this update are improvements to the RMI app (Raw Model Instructions)!
You are now able to adjust prompt templates used for your custom models. ChatML and Vicuna are given as examples, but the configuration should really support any format.
Also in the RMI app, you can manage storage and delete the custom models copied to your app's cache. This should help those of you who like to rapidly switch between lots of different models.
In answer to all your feedback regarding proactive messaging, "Proactive Chat" has been moved to its own app. This gives you fine-grained control over how you want it to work. You can select the frequency as usual, in addition, you can also choose which characters are allowed to proactively chat with you. Removing the "Proactive Chat" app stops all future notifications from being sent to you; adding it back re-enables the notifications. I hope this addresses the majority of use-cases!
Note: In this update proactive chat has been turned off for everyone. If you would like to re-enable it, please add the app to Layla in the App Screen.
Lastly, this version continues to lay the groundwork for long term memory, including a locally implemented vector database and embedding model with sub-1-second performance.
- Added "Proactive Chat" app.
- Added "show performance counters" in advanced settings: this will show CPU, RAM usage and other system information during chatting.
- Added ability to manage model storage space in Raw Model Instructions app.
- Added details screen for each app, with a longer description on what the app does and instructions on how to use.
- Added ability define custom prompt formats in RMI.
- Fixed bug in advanced settings where you were unable to turn off dynamic temperature sampling.
- Fixed bug where Precise settings were not turning off dynamic temperature.
- Fixed bug where scheduled proactive messages were not cancelled even though proactive messaging is turned off.
As the number of features ("apps") in Layla increases, a new way to manage them is introduced! "Features" in Layla...
As the number of features ("apps") in Layla increases, a new way to manage them is introduced! "Features" in Layla now works like a "mini app store": you can select which apps you want to include and remove them if you don't need them.
Adding an app enables the corresponding feature, and removing it disables it. For example, if you add the "To-do" app, Layla will pick up scheduling instructions, and once you remove the app, Layla will ignore your scheduling instructions. The "mini app store" replaces the individual "enable/disable" toggle of apps.
Note: with this update, all your apps are disabled by default, you can add them back using the browse app screen. There's no additional download for each app.
- Added ability to add/remove apps.
- Laid out foundation framework for "long term memory".
- Guard against accidental back button press or swipe in chat screen (confirmation popup won't trigger if you manually click the back arrow)
- Added the ability to edit chat names so you can find them better in the chat history screen.
- Chat input now supports multiple lines.
- Streamlined proactive message logic: character message notifications should be scheduled much more intuitively now.
Changed the default sampling method for Layla to Dynamic Temperature Sampling. See the paper
Changed the default sampling method for Layla to Dynamic Temperature Sampling. See the paper: https://arxiv.org/abs/2309.02772
This sampling method is reported to provide more organic responses from models by various users in the local AI community. It has already been implemented in other popular frontends such as Oobabooga, KoboldCpp, etc., and is scheduled to be added to SillyTavern as well, so it as seen a fair number of real-world use-cases.
You can turn this feature off by setting "Dynamic Temperature Range" setting in the Advanced Settings page to 0.
- Added "dynamic temperature sampling".
- Added "min P" config to be used with dynamic temperature sampling.
- Added better estimation of character loading times.
- You are now able to adjust the frequency of proactive messaging.
- Fixed a bug where character backgrounds were not shown correctly in the character selection screen on wider phones.
- Fixed a bug where your astrologer in My Horoscopes app sometimes outputs the {{user}} template.
- Fixed a bug where second time loading was not reading the cache in some cases so was loading slowly.
Added a new app: "My Horoscopes".
Added a new app: "My Horoscopes".
This app gives you the latest horoscope information for each day. You can then chat with Layla about your horoscopes. Layla will give you tailored advice taking into consideration your horoscope for the day. You can also optionally enable notifications each morning to send you your horoscope information. As always, please don't take advice generated by an AI seriously.
Your personal information (such as your zodiac sign or birthday) is not leaked. This app works by downloading information for ALL horoscopes first, then you can choose which one is yours locally on your device.
Horoscopes may seem a frivolous little feature at first glance, but this is a proof-of-concept for two important operations:
Real-time information retrieval using the internet.
Augmented generation using relevant info (also known as RAG)
This is laying the foundation for a very important update later: the ability for Layla to search and verify information on the internet.
- "My Horoscopes" app.
- Automatically determine the optimal number of model layers to offload to GPU (slight performance boost)
- Fixes bug where chat does not load until you clear your cache.
This update contains various quality-of-life improvements.
This update contains various quality-of-life improvements.
- Added "favourite character" feature.
- You can now filter by "favourite" characters (your filter is saved)
- Added quick buttons to edit and delete character in selection screen.
- Use scrollbar instead of cutting off character description.
- Added email support icon in settings page.
- Updated backend with stability improvements.
- Fixed bug where filtering was not working in grid view.
- Fixed bug where preset characters are not showing in grid view.
- Fixed bug where opening proactive chat notifications causes chat to not work.
Layla now supports any GGUF models on the internet!
Layla now supports any GGUF models on the internet!
You can download models from your own sources and load them into Layla. A new app "Raw Model Instructions" is added where you can configure the settings for your own GGUF models. While the inbuilt "Layla" models are still recommended for general use, any sufficiently intelligent LLM model should be able to pick-up the prompt style of Layla relatively quickly after a few conversations and may achieve better results in specific areas such as storytelling, roleplay, etc. This is an advanced feature; use at your own discretion. Loading a model too large for your phone memory will crash without warning.
- Added ability to choose any GGUF model as base.
- Added new model: Goldfish. A tiny model about as intelligent as the world's smartest goldfish. This model runs on devices with less than 4GB of memory and is mainly here to get Apple off my back about not supporting the ancient devices they use to test with during the review process.
- You can now select your base model on first-time launch of Layla.
- Model selection screen now gives you visual cues recommending the best model for your device.
- Backend stability improvements.
- To-do app is now enabled by default and limited to "Layla" character only.
- Fixed bug where auto-update feature was not pushing out updates to everyone properly.
- Fixed bug where "hands-free" mode was not working altogether.
Fixed a bug where text-to-speech in Hands Free Mode was not working at all.
- Fixed a bug where text-to-speech in Hands Free Mode was not working at all.
Upload custom characters to the Personalities Hub!
- Upload custom characters to the Personalities Hub!
- Backend stability improvements.
- Fixed bug where long character descriptions were covering the "Start Chat" button.
- Fixed bug where filters in character selection screen were not working.
- Fixed crash sometimes when app returns from the background.
Added a new app: Personalities Hub.
- Added a new app: Personalities Hub.
- You can download latest characters shared by other users in the Hub.
- Uploading/sharing your own characters feature coming soon!
- Like/rating characters feature coming soon!
- Todo app now only triggers when talking to Layla to avoid false-positives during conversation (characters are not designed to provide assistant functionalities anyway)
- Fixed bug where very long conversation histories will not load.
- Fixed bug where Todo app was stopping Layla from responding.
Fixed bug where Todo app was stopping Layla from responding (temporarily disabled Todo app until more permanent fix)
- Fixed bug where Todo app was stopping Layla from responding (temporarily disabled Todo app until more permanent fix)
This update comes with completely new base models for both the Lite, Q2, and Full versions! After updating, the app...
This update comes with completely new base models for both the Lite, Q2, and Full versions! After updating, the app will start a new 2-4GB download! (Legacy models will not be deleted. You can switch back to them in the settings page if you wish.)
- Complete newly trained models for Lite, Q2, and Full.
- Drastically improved reasoning, reduced hallucination for all models.
- Roughly speaking, the new Lite version achieves the intelligence of the previous Q2, new Q2 is comparable to the old Full, and the new Full model is in a league of its own.
- This intelligence upgrade is done without any penalty to performance (i.e. Lite is as fast as before, AND much smarter as well)
- Upgraded backend to latest version, supporting the latest version of Android.
- Adjusted general character prompts to get a better experience when using the "mirostat" sampler.
- Fixed bug where some advanced settings were showing "NaN".
- Fixed bug where copying messages was not working.
- Fixed bug where proactive messages were not being scheduled and increased frequency of proactive messaging.
SillyTavern Character PNG support.
- SillyTavern Character PNG support.
- Custom characters now understand Tavern Character V2 format (e.g. placeholders such as {{char}} and {{user}})
- Character creation now supports "scenario" instructions to better direct the conversation.
- Ability to import Tavern PNGs from sites such as chub.ai etc.
- General backend stability improvements.
Added ability to edit Layla's last response.
- Added ability to edit Layla's last response.
- Added advanced setting: "mirostat": this enables the experimental mirostat sampler.
- Added advanced setting: "n threads": sets the number of CPU threads used by Layla.
- Set Layla's inference thread to always be "high priority" to nudge the OS into scheduling Layla's processes on the high-performance CPUs.
- Updated backend with stability and bug fixes.
- Fixed bug where chat history scroll bar was being cutoff on some phone sizes.
- Fixed bug where custom greetings are not shown in custom characters.
- Fixed bug in custom characters where they will refer themselves as an "AI" sometimes.
- Fixed bug where roleplay sessions are not being shown in chat history.
- Fixed bug where context length was set too high after switching models.
Fixed issue where roleplays were not loading correctly after last update.
- Fixed issue where roleplays were not loading correctly after last update.
Added "Q2" intermediary base model. This is a 7B model, smarter than the Lite version, and faster than the Full...
- Added "Q2" intermediary base model. This is a 7B model, smarter than the Lite version, and faster than the Full version. For use in devices with moderately powered hardware. You can switch to this model in the settings page.
- Added "context length", "n batch", and "LORAs" advanced settings.
- Backend stability improvements.
- Fixed a bug where clearing your conversation history does not delete temporary image files in chat history.
- "hands free mode" now continues to listen indefinitely until you cancel instead of stopping if you haven't started speaking in 2 seconds.
- Removed "New Chat" button since it's not compatible with numerous latest features such as "roleplay" etc.
Added the ability for Layla to look at and understand images! (Important: this feature is non-local)
- Added the ability for Layla to look at and understand images! (Important: this feature is non-local)
- Improved formatting for user info so Layla understands it better.
- Added "Layla is thinking..." message during chat.
- Fixed bug where roleplay scenes were always using the template description for "you".
- Demo: https://www.youtube.com/shorts/vfuZaTRO1iE
Autonomous Roleplay: set "non-interactive" setting to true when creating a new Roleplay scene to allow Layla to...
- Autonomous Roleplay: set "non-interactive" setting to true when creating a new Roleplay scene to allow Layla to continue the story without your input.
- Improved Layla processing times on Apple devices by ~10%.
- Minor backend stability and performance improvements.
- Added file size display for all caches and models in settings page.
- Added ability to copy message to clipboard by long-pressing on any message.
- Increased roleplay scene description limit to 1000 characters.
- Roleplay scenes and characters are now included in backup export and import.
Added advanced settings page, allows users to set custom temperature and top-p.
- Added advanced settings page, allows users to set custom temperature and top-p.
- Updated backend with stability fixes.
- Improved performance of first time loading a character by ~10%.
- Fixes issue where images are scrambled around on some Android phones after update.
- Fixed various issues during text-to-voice such as Layla stopping speech on new lines, comma's, etc.
- Fixed issue with To-do app not working.
Fixed bug where preset characters were not hiding in scroll view.
- Fixed bug where preset characters were not hiding in scroll view.
- Fixed bug where creating a new roleplay scene from scratch doesn't work.
- Fixed bug where create character plus button is not showing in grid layout.
Added Roleplay app; you can now roleplay different scenes with multiple characters; start from a template, or create...
- Added Roleplay app; you can now roleplay different scenes with multiple characters; start from a template, or create your own scene from scratch!
- Demo: https://youtu.be/YdnZY2UCPMc
Characters will now (with your permission) start chats with you proactively by sending you notifications.
- Characters will now (with your permission) start chats with you proactively by sending you notifications.
- Fixed an issue where characters was ending the conversation without replying anything.
Added "Report a bug" menu item in the chat page (this will share a technical log of your latest chat for bug...
- Added "Report a bug" menu item in the chat page (this will share a technical log of your latest chat for bug reporting. Please send it to info@layla-network.ai and I will look into it!)
- Updated design for base model selection in settings page.
- Updated backend to prepare for multilingual base models.
- Fixed bug where some characters were crashing on phones with only 8GB of memory.
Significant improvement for response times on iOS (now utilises Apple Metal GPUs)
- Significant improvement for response times on iOS (now utilises Apple Metal GPUs)
- Moderate improvement for response times on Android by offloading data to GPUs if available.
- Fixed character search feature.
- Fixed issue where keyboard was sometimes hiding conversation when typing.
Fine-tuned all personalities to make them stay in character better, with better proactive conversation leading.
- Fine-tuned all personalities to make them stay in character better, with better proactive conversation leading.
- Added setting to enable/disable To-do app.
- Improved scheduler to remove more false positives in scheduling instruction detection.
- Improved speed of loading characters (after the first time) significantly.
- Do not automatically delete old model when switching between Layla Full and Lite.
- Fixed an issue where Layla will respond extremely slowly after several pages of conversation as the context is exhausted and Layla re-computes the conversation.
- Added "diagnostics" function in settings page to quickly identify issues with Layla.
- Upgraded various dependencies to improve performance and stability.
- Fixed a bug where Layla will lose the conversation after loading from chat history.
- Fixed bug where custom character chat histories were not being shown.
- Fixed bug where Layla Lite was crashing during chat.
Upgraded backend to latest version with numerous stability and speed improvements.
- Upgraded backend to latest version with numerous stability and speed improvements.
- Added ability to change the layout of character selection screen; choose from Grid and Scroll views.
- Added option to hide default characters.
- Added a new video for the splash screen.
- Fixed bug where it was under-detecting device RAM and switching to Layla Lite unnecessarily.
- Fixed a bug where the "Use Full Layla anyway" button was not showing.
- Fixed a bug where Layla will not stop talking after exiting hands-free mode.
Significantly improved the speed of Layla in scheduling tasks and reminders. Demo
- Significantly improved the speed of Layla in scheduling tasks and reminders. Demo: https://youtube.com/shorts/F07Qwkx5o3s?feature=share
- Layla Feature Showcase - scheduling instructions.
Added "hands-free" mode. You can talk to Layla like a normal conversation with completely voice. Layla will respond...
- Added "hands-free" mode. You can talk to Layla like a normal conversation with completely voice. Layla will respond with voice as well.
- Significantly improved Layla Lite's cognitive abilities.
Better formatting for messages and response: actions are now italic, code is now formatted properly, lists are now...
- Better formatting for messages and response: actions are now italic, code is now formatted properly, lists are now formatted properly.
- Display current version number in settings page.
- Added upgrade notice popup when app first starts.
- Fixed a bug where user created characters were not responding.