Will they add Vocal Modes to other voices?

claire · 2022 年 5 月 29 日午後 5:51

In order to train the AI to sing in different ways, it’s very likely they need to tell the AI which learning materials (ie the recorded samples it analyzes) are in which style, so that the AI can build associations.

In other words, by default the AI’s learning algorithm simply analyzes songs, but it has no concept of “soft”, “whisper”, “belting”, etc. Vocal modes are most likely created by providing the AI additional context for the things it has learned so it can observe a new type of pattern.

(please note this is speculation based on what I know of machine learning in general, and the actuality may be different since Dreamtonics is quite secretive about their processes)

The exact modes for each voice and the words used to describe them would likely be dependent on:

The recorded samples available - how many different songs did they get the voice provider to record, and how much stylistic variety is there within that set (ie, “in which ways can the learning materials be categorized?”)
The voice character/avatar/mascot - what type of voice is the product marketed as, what kind of focus does the company want for their specific character, and which modes/styles support that image

shiwei_migi · 2022 年 5 月 29 日午後 7:06

Going by what you’re saying with the second point there, you can very clearly see this with ANRI. It seems the image in the tweet hints to what she is getting: Emotive, Chill, Light, and the fourth seems to be “Charm” as written on her top. Very matching in her character if this is true.

shiwei_migi · 2022 年 6 月 26 日午後 1:52

Not too much news from today’s TOKYO6 livestream about the Vocal Modes, but they’ve mentioned that Karin and Rikka are progressing. Chifuyu will also have Vocal Mode (5 variations) added in and they hope to have it completed by the time she releases (October), but I won’t further share much more about it since she has her own dedicated threads and they’re aiming to have it done for her final release (Synthesizer V AI Hanakuma Chifuyu (花隈千冬): New Japanese Female Voice from TOKYO6 ENTERTAINMENT and Synthesizer V「花隈千冬」即将众筹). If for whatever reason she ends up NOT getting them within her first release version, then I’ll mention her more here.

Source: 第7回TOKYO6公式生放送【花隈千冬クラウドファンディングを番組中にスタート！】 - YouTube

shiwei_migi · 2022 年 7 月 2 日午前 7:40

Another SOLARIA Vocal Mode demo

“Erode” uses Airy, Power, and Soft

She is getting 7 in total: Soft, Solid, Airy, Power, Clear, Passionate, and Light

Her Vocal Mode update is expected to come within the next 2 weeks. (Source: https://twitter.com/eclipsedsounds/status/1543101202748936193)

shroomy-p · 2022 年 7 月 2 日午後 3:34

solaria’s power vocal mode ATE

claire · 2022 年 7 月 6 日午後 2:24

Solaria’s vocal modes are now available:

twitter.com

Eclipsed Sounds

@eclipsedsounds

SOLARIA’s vocal modes update is now available in her ver. 103 update! To update SOLARIA, navigate to the License and Updates panel on the right-hand toolbar, click "Check for Updates", select SOLARIA, and click Update! Check out this thread for a description of each mode. twitter.com/eclipsedsounds…

Eclipsed Sounds @eclipsedsounds

"Erode" by @unit0music - a new progressive rock track created to showcase 3 of Synthesizer V AI SOLARIA's 7 upcoming vocal modes: Soft, Airy, & Power! 🌌 Listen to the full short version here: https://t.co/pNtuD9BLp8 https://t.co/dAYdzXHzus

2:17 PM - 6 Jul 2022 14 11

From the thread:

Mode 1: Clear
Clear provides a bright voice that helps SOLARIA pierce through the mix of any song to be heard clearly! When mixed with other modes or her default voice, it can be used to naturally raise the brightness of her tone without relying on effects in mixing.

Mode 2: Soft
Soft gives SOLARIA a gentle and mellow tone, stripping out the natural power in her voice to give the user discrete control over her expression, with added breathiness and darkness to the voice.

Mode 3: Airy
A whispery mode that provides a unique tone. While similar to Soft, it removes the support from SOLARIA’s voice, leaving only a light and gentle quality. This can create unique effects as backing vocals, or add a new layer of expression for more atmospheric vocals.

Mode 4: Power
Power brings out SOLARIA’s belting ability directly, lending a strong, emotional voice to the user even more easily even just after install.

Mode 5: Passionate
Passionate is the closest to SOLARIA’s default tone, and it is based on the most emotional lines of each of the songs in SOLARIA’s data, increasing her natural emotional expression.

Mode 6: Solid
This mode packs energy into every syllable, helping to bring out a different side of SOLARIA’s strong tones. When mixed with other modes or her default voice, it can add a base of support to add punch to the vocals.

Mode 7: Light
A mode based only on the upper part of her range, typically described as “head voice,” light can provide a unique tone across the range, with bright high notes and more solid low notes. It is best used for songs in a higher register.

synthvoice · 2022 年 7 月 6 日午後 6:22

Is anyone having problems with the volume in certain vocal modes? Airy and Power sound good, but I can barely hear the voice sometimes.

Run_djp_Run · 2022 年 7 月 7 日午前 10:32

Yes, me too, I don’t understand why

synthvoice · 2022 年 7 月 7 日午後 6:55

Some people have complained on Twitter.

synthvoice · 2022 年 7 月 7 日午後 7:01

Seems like many people are having the same issue…

shiwei_migi · 2022 年 7 月 12 日午後 3:48

SOLARIA v104 is here! This should correct the sound quality issues and improve the tone consistency. There will still be some improvements to be made in the future, but at least we got an update rather quickly.

Run_djp_Run · 2022 年 7 月 12 日午後 8:59

Just perfect

Only Cangqiong AI and Muxin AI are missing and I’ll be fine

claire · 2022 年 7 月 12 日午後 9:51

Do keep in mind that Cangqiong and Muxin are being repackaged as AI banks using their original Standard voices as a basis, rather than by recording new materials. There is no guarantee that this method will allow for vocal mode support.

While I certainly do hope the Quadimension AI ports turn out well, I’m hesitant to get my hopes up since they are not being made in the same manner as existing AI voices.

shiwei_migi · 2022 年 8 月 17 日午後 9:58

Eleanor Forte is expected to get Vocal Mode options in the future.

synthvoice · 2022 年 8 月 18 日午後 9:48

I hope they don’t do the same thing they did to Anri…her vocal modes all sound the same and have no impact on the voice…

shiwei_migi · 2022 年 11 月 10 日午前 9:27

Synthesizer V Studio got updated to 1.8.0b1. Along with the beta voice databases, Rikka ended up getting her Vocal Modes along with it: Kawaii, Soft, Pops, Emotional, and Ballade.

Eleanor ended up getting them too, but VOLOR announced that they still need further testing. We are also expecting demos for the Eleanor Vocal Modes

shiwei_migi · 2022 年 11 月 11 日午後 6:10

Eleanor Vocal Mode demos dropped! She’s getting 8 in total: “Solid”, “Tender”, “Bold”, “Melancholic”, “Powerful”, “Dark”, “Warm”, and “Clear”. The 108b1 installer that’s compatible with version 1.8.0b1 will be coming out early next week. .w./

shiwei_migi · 2022 年 11 月 14 日午前 10:18

Eleanor is here now!

shiwei_migi · 2023 年 4 月 1 日午後 4:55

Long time no see on this thread. We have a new update confirmed for one of the older AI voices who released pre-vocal mode. According to JUN’s newest demo, he has a mode known as “Tsubaki” which was noted to have data that was designed specifically to improve Japanese Cross-lingual Singing Synthesis.

ANRI was noted to have a similar update in the future. Figured to probably add this info here.

Explanation for the Vocal Mode name: