diff --git a/LICENSE.md b/LICENSE.md new file mode 100644 index 0000000..c0104ed --- /dev/null +++ b/LICENSE.md @@ -0,0 +1,21 @@ +ANTI-CAPITALIST SOFTWARE LICENSE (v 1.4) + +Copyright © 2022 Trysdyn Black + +This is anti-capitalist software, released for free use by individuals and organizations that do not operate by capitalist principles. + +Permission is hereby granted, free of charge, to any person or organization (the "User") obtaining a copy of this software and associated documentation files (the "Software"), to use, copy, modify, merge, distribute, and/or sell copies of the Software, subject to the following conditions: + +1. The above copyright notice and this permission notice shall be included in all copies or modified versions of the Software. + +2. The User is one of the following: +a. An individual person, laboring for themselves +b. A non-profit organization +c. An educational institution +d. An organization that seeks shared profit for all of its members, and allows non-members to set the cost of their labor + +3. If the User is an organization with owners, then all owners are workers and all workers are owners with equal equity and/or equal vote. + +4. If the User is an organization, then the User is not law enforcement or military, or working for or under either. + +THE SOFTWARE IS PROVIDED "AS IS", WITHOUT EXPRESS OR IMPLIED WARRANTY OF ANY KIND, INCLUDING BUT NOT LIMITED TO THE WARRANTIES OF MERCHANTABILITY, FITNESS FOR A PARTICULAR PURPOSE AND NONINFRINGEMENT. IN NO EVENT SHALL THE AUTHORS BE LIABLE FOR ANY CLAIM, DAMAGES OR OTHER LIABILITY, WHETHER IN AN ACTION OF CONTRACT, TORT OR OTHERWISE, ARISING FROM, OUT OF OR IN CONNECTION WITH THE SOFTWARE OR THE USE OR OTHER DEALINGS IN THE SOFTWARE. diff --git a/README.md b/README.md index cd5fba3..a9320c9 100644 --- a/README.md +++ b/README.md @@ -1,3 +1,89 @@ -# pyngtube +# Pyngtube +A simplistic Python application for two-state animation PNGtubing, using only flatfiles and images for configuration. -Simplistic flat-file based no-frills pngtuber application written in Python \ No newline at end of file + +## Why Another VTuber App? +Several applications exist in this space, but most of them are either non-free or complicated and use strange marshalled data formats while providing excessive, confusing features that are not necessary for a simple approach. Pyngtube aims to provide a simple set of features in an accessible format using only flat images and YAML files in a way that can be understood and put into use in minutes. + +Pyngtube targets a simple "two-state animation" approach. It doesn't do any kind of facial detection and instead monitors microphone amplitude. The application displays a "closed mouth" image when the microphone is quiet and an "open mouth" image when it's not. That's all there is to it. + +A couple of other fun features exist such as minor movement animations when the user is talking and randomized blinking, but these are wholly optional and default to off if not configured at all. They can be ignored if all you want is the core experience. + + +## Requirements and Installation +pyngtube is built for Python3. Python2 is long since EOLed; no support will be provided. Python 3.6 should be the minimum required version but is testing on 3.9 and 3.10. + +pyngtube requires tkinter, Pygame, PyYAML, and PyAudio. Fairly hefty requirements for what it does. You can get a simplified rundown in `requirements.txt`. + +Installation should be fairly simple. The most direct approach is to install the pre-requisites using Pip like so. + +``` +pip install -r requirements.txt +``` + +However good housekeeping requires creating a Python Virtualenv for the pre-reqs. Virtualenvs are a good thing to know if you use Python code in any capacity, but a short rundown of how to do this: + +### Linux + +``` +python -m venv pyngtube-env +source pyngtube-env/bin/activate +pip install -r requirements.txt +``` + +### Windows + +``` +python -m venv pyngtube-env +pyngtube-env\Scripts\activate +pip install -r requirements.txt +``` + +If you utilized a virtualenv, you will need to activate it before launching pyngtube to make sure you're using the environment properly. + +If you run into build errors at the `pip install` step you're likely on an OS that requires building python libs from scratch. Hopefully this means you're on Linux and have some inkling of how to proceed from here. It might just require installing your distro's build tools. + + +## Usage +Basic launching can be accomplished with `python main.pyw` in the pyngtube dir, or double-clicing the main.pyw file. This should launch a default experience with a very crappy hand-drawn default avatar. + +There is no user interface. You can scroll the mousewheel inside the window to change the microphone threshold and can right-click in the window to open a dialog to select a new profile to load. See Configuration below for avatar configuration specs. + + +## Configuration +The main configuration file is ptv.yaml, in the same directory as main.pyw. You may also manually provide a different config file location as an argument to main.pyw. + +This file is a standard yaml file of configuration directives. It is recommended that this main config file only include the bare necessities of configuring the program: microphone settings and window size if you need it. Then the avatar settings should be imported via a profile declared with the `profile` directive. + +A profile is a directory that consists of a second configuration yaml file, and all the images needed to make the avatar work. The `profile` directive should be simply the name of the directory and the yaml file must be named profile.yaml. Pyngtube will attempt to find this profile in the pyngtube installation directory. profile.yaml must declare the images to be used for the avatar. + + +## Troubleshooting +If the app crashes, closes suddenly, or doesn't open at all, the first step is to launch the app from a command line and see if any errors are printed. The only real cause for a launch failure should be a broken config file, which will print an error explaining the problem. + +Configuration errors are fatal (causing a forced exit) by design, including invalid keys in the configuration file. + +If the terminal outputs errors about your system not being built to support tcl/tk, tkinter, etc, you're likely missing a tcl/tk package. This should only happen under Linux; Windows installs of Python should always support tk. + + +## Known Issues +First and foremost, this is a personal tool I'm making available as an (arguably bad) example and starting point for someone else's personal tool. This means assumptions are made about my environment that may not work in your environment. Certain bugs I may not care about that you do. Here's a list of what I can think of that's a little "off" right now: + +* The application is not hardware accelerated, so CPU usage can get a little high (for my taste anyway, 3-5% on my dev box) if you're scaling a small image to a massive one or vise-versa. This is an easy fix with better buffer handling and is on The List +* Due to design praxis of not drawing anything over the capture region to "ruin the magic" of the avatar, there is no visible feedback on threshold changes. You have to trust mousewheeling in the window changes threshold +* For that matter, there's no way to change threshold if you're using a pointing device without a wheel short of editing the config file +- What little debugging and error raising there is prints to the console. This needs to be reworked to raise actual GUI errors +- pyngtube makes the awkward assumption that you have one audio input device. It should work fine even if you don't, but some configuration around this should be provided +- I don't like the dependency on tkinter because it's undisclosed and cannot be put into `requirements.txt` but the greater evil prior was a depdency on wxPython which has a massive build and install process. Most py installations come with tkinter bundled + + +## Contributing +My Gitea instance is closed to new registrations so contributions aren't really accepted at the moment. If you must submit a code change, let me know and I'll arrange something. + +Code is formatted with [python-black](https://github.com/psf/black) before commit. + + +## Why The Weird License? +The [ACSL](https://anticapitalist.software/) most closely aligns with how I want things I create to be used. In a better world I'd license things MIT or even some variant of CC-BY, but I've seen and experienced time and time again that fully open software tends to filter up, benefitting the largest corporations more than the common user. This results in megacorps benefitting while providing nothing in return and, in the worst cases, even demanding FOSS devs perform free labor for them to perform their security audits, investigate their change requests for their specific use cases, and the like. + +I hold no illusion that anything I make will ever be important, but I refuse to even put myself in the position where that could occur. Is the ACSL enforceable? That's a matter of test-by-trial. It says what I want to say with my work; that's what matters. diff --git a/config.py b/config.py new file mode 100644 index 0000000..585e048 --- /dev/null +++ b/config.py @@ -0,0 +1,306 @@ +import os +import sys +import yaml + + +class Config: + def __init__(self): + # Load initial config file + if len(sys.argv) > 1: + config_file = sys.argv[1] + else: + config_file = os.path.join(os.path.dirname(__file__), "ptv.yaml") + self.load_config(config_file) + + # Load the profile explicitly at the end of initiailization so that + # profile settings take precedence over global config settings + self.load_config(os.path.join(self.profile, "profile.yaml")) + + def load_config(self, filename): + with open(filename, "r") as infile: + self._image_closed = None + self._image_open = None + self._image_blink_closed = None + self._image_blink_open = None + + for k, v in yaml.load(infile, Loader=yaml.Loader).items(): + # This ugly thing checks if a config parameter named in the + # config file maps to a class property. If not, we throw an + # error and exit because the config file is broken. + # FIXME: Find a better way to do this + if not isinstance(getattr(type(self), k, None), property): + print(f"Invalid config paramter: {k}") + sys.exit(1) + setattr(self, k, v) + + @property + def profile(self): + try: + return self._profile + except AttributeError: + print("profile is a required parameter") + sys.exit(1) + + @profile.setter + def profile(self, profile): + # Check filename is a string + if not isinstance(profile, str): + print("Config paramter profile must be a valid filename") + sys.exit(1) + + # Look for file in working dir and script dir + targets = [os.path.join(os.path.dirname(__file__), profile), profile] + + for target in targets: + if os.path.exists(target): + if not os.path.exists(os.path.join(target, "profile.yaml")): + print(f"Profile path {profile} does not contain a profile.yaml file") + sys.exit(1) + self._profile = target + self.load_config(os.path.join(target, "profile.yaml")) + return + + # Can't find the profile file, fail + print(f"Cannot find file {profile}") + sys.exit(1) + + @property + def image_closed(self): + try: + return self._image_closed + except AttributeError: + print("image_closed is a required paramter") + sys.exit(1) + + @image_closed.setter + def image_closed(self, filename): + # Check filename is a string + if not isinstance(filename, str): + print("Config parameter image_closed must be a valid filename") + + # Look for the file in profile dir, script dir, and working dir + targets = [os.path.join(self.profile, filename), os.path.join(os.path.dirname(__file__), filename), filename] + + for target in targets: + if os.path.exists(target): + self._image_closed = target + return + + # Can't find the image, fail + print(f"Cannot find file {filename}") + sys.exit(1) + + @property + def image_open(self): + try: + return self._image_open + except AttributeError: + return None + + @image_open.setter + def image_open(self, filename): + # Check filename is a string + if not isinstance(filename, str): + print("Config parameter image_open must be a valid filename") + + # Look for the file in profile dir, script dir, and working dir + targets = [os.path.join(self.profile, filename), os.path.join(os.path.dirname(__file__), filename), filename] + + for target in targets: + if os.path.exists(target): + self._image_open = target + return + + # Can't find the image, fail + print(f"Cannot find file {filename}") + sys.exit(1) + + @property + def image_blink_open(self): + try: + return self._image_blink_open + except AttributeError: + return None + + @image_blink_open.setter + def image_blink_open(self, filename): + # Check filename is a string + if not isinstance(filename, str): + print("Config parameter image_blink_open must be a valid filename") + + # Look for the file in profile dir, script dir, and working dir + targets = [os.path.join(self.profile, filename), os.path.join(os.path.dirname(__file__), filename), filename] + + for target in targets: + if os.path.exists(target): + self._image_blink_open = target + return + + # Can't find the image, fail + print(f"Cannot find file {filename}") + sys.exit(1) + + @property + def image_blink_closed(self): + try: + return self._image_blink_closed + except AttributeError: + return None + + @image_blink_closed.setter + def image_blink_closed(self, filename): + # Check filename is a string + if not isinstance(filename, str): + print("Config parameter image_blink_closed must be a valid filename") + + # Look for the file in profile dir, script dir, and working dir + targets = [os.path.join(self.profile, filename), os.path.join(os.path.dirname(__file__), filename), filename] + + for target in targets: + if os.path.exists(target): + self._image_blink_closed = target + return + + # Can't find the image, fail + print(f"Cannot find file {filename}") + sys.exit(1) + + @property + def bg_color(self): + try: + return self._bg_color + except AttributeError: + return (0, 0, 0) + + @bg_color.setter + def bg_color(self, color): + if not isinstance(color, (list, tuple)) or len(color) != 3: + print("Config parameter bg_color must be a three-item list") + sys.exit(1) + self._bg_color = color + + @property + def audio_checks(self): + try: + return self._audio_checks + except AttributeError: + return 60 + + @audio_checks.setter + def audio_checks(self, checks): + if not isinstance(checks, int): + print("Config parameter audio_checks must be an integer") + sys.exit(1) + self._audio_checks = checks + + @property + def threshold(self): + try: + return self._threshold + except AttributeError: + return 6000 + + @threshold.setter + def threshold(self, threshold): + if not isinstance(threshold, int): + print("Config parameter threshold must be an integer") + sys.exit(1) + self._threshold = threshold + + @property + def smoothing(self): + try: + return self._smoothing + except AttributeError: + return 3 + + @smoothing.setter + def smoothing(self, smoothing): + if not isinstance(smoothing, int): + print("Config paramter smoothing must be an integer") + sys.exit(1) + self._smoothing = smoothing + + @property + def mic_rate(self): + try: + return self._mic_rate + except AttributeError: + return 44100 + + @mic_rate.setter + def mic_rate(self, rate): + if not isinstance(rate, int): + print("Config paramter mic_rate must be an integer") + sys.exit(1) + self._mic_rate = rate + + @property + def mic_stereo(self): + try: + return self._mic_stereo + except AttributeError: + return True + + @mic_stereo.setter + def mic_stereo(self, is_stereo): + if not isinstance(is_stereo, bool): + print("Config paramter mic_stereo must be a boolean") + sys.exit(1) + self._mic_stereo = True + + @property + def blink_frames(self): + try: + return self._blink_frames + except AttributeError: + return 0 + + @blink_frames.setter + def blink_frames(self, frames): + if not isinstance(frames, int): + print("Config parameter blink_frames must be an integer") + sys.exit(1) + self._blink_frames = frames + + @property + def blink_chance(self): + try: + return self._blink_chance + except AttributeError: + return 0 + + @blink_chance.setter + def blink_chance(self, chance): + if not isinstance(chance, float) and not isinstance(chance, int): + print("Config parameter blink_chance must be an integer or float") + sys.exit(1) + self._blink_chance = chance + + @property + def shake_delay(self): + try: + return self._shake_delay + except AttributeError: + return 0 + + @shake_delay.setter + def shake_delay(self, delay): + if not isinstance(delay, int): + print("Config parameter shake_delay must be an integer") + sys.exit(1) + self._shake_delay = delay + + @property + def shake_intensity(self): + try: + return self._shake_intensity + except AttributeError: + return 0 + + @shake_intensity.setter + def shake_intensity(self, intensity): + if not isinstance(intensity, int): + print("Config parameter shake_intensity must be an integer") + sys.exit(1) + self._shake_intensity = intensity diff --git a/default/default_blink_closed.png b/default/default_blink_closed.png new file mode 100644 index 0000000..a6184aa Binary files /dev/null and b/default/default_blink_closed.png differ diff --git a/default/default_blink_open.png b/default/default_blink_open.png new file mode 100644 index 0000000..74dc48a Binary files /dev/null and b/default/default_blink_open.png differ diff --git a/default/default_closed.png b/default/default_closed.png new file mode 100644 index 0000000..a12bb31 Binary files /dev/null and b/default/default_closed.png differ diff --git a/default/default_open.png b/default/default_open.png new file mode 100644 index 0000000..f02c128 Binary files /dev/null and b/default/default_open.png differ diff --git a/default/profile.yaml b/default/profile.yaml new file mode 100644 index 0000000..383570c --- /dev/null +++ b/default/profile.yaml @@ -0,0 +1,17 @@ +--- +# Image set. Four images for eyes and mouth close/open +image_closed: default_closed.png +image_open: default_open.png +image_blink_closed: default_blink_closed.png +image_blink_open: default_blink_open.png + +# Color to render the background of the window. Useful for color key capturing +bg_color: [0, 255, 0] + +# Overriding shake settings from the main yaml file +# You can override most settings +shake_delay: 2 +shake_intensity: 5 + +# Override blink chance from main yaml file +blink_chance: 0.05 diff --git a/main.pyw b/main.pyw new file mode 100644 index 0000000..c9921e4 --- /dev/null +++ b/main.pyw @@ -0,0 +1,354 @@ +import array +import config +import os +import pyaudio +import random +import typing +import tkinter +from tkinter import filedialog + + +os.environ["PYGAME_HIDE_SUPPORT_PROMPT"] = "1" +import pygame +from pygame.locals import * + + +__author__ = "Trysdyn Black" +__license__ = "ACSL v1.4" +__version__ = "1.0.0" +__copyright__ = "Copyright 2022, Trysdyn Black" + + +class Audio: + """Audio subsystem that monitors a microphone and determines avatar state. + + Methods + ------- + get_state() -> bool + Evaluate buffered audio data and return true if the average amplitude is over the configured threshold. + Indicates the mic is hot. Also drains the audio buffer. + """ + + def __init__(self, channels: int, rate: int, frames: int, threshold: int = 5000, smoothing: int = 0) -> None: + """ + Parameters + ---------- + channels : int + Number of audio channels in the input device, usually 1 or 2 for mono/stereo + rate : int + Baud rate of the audio device, typically 44100 or 48000 as configured in Windows + frames : int + Number of times a second to check the audio buffer. The buffer is fully drained each check + threshold : int + Audio amplitude at which we consider the mic hot + smoothing : int + Number of frames we continue to consider the mic hot after it stops being hot + """ + p = pyaudio.PyAudio() + self.frames = frames + self.stream = p.open( + format=pyaudio.paInt16, + channels=channels, + rate=rate, + input=True, + frames_per_buffer=frames, + ) + + self.threshold = threshold + self.smoothing = smoothing + self.this_smooth = 0 + + def get_state(self) -> bool: + """Check audio buffer and indicate if the avatar's mouth should be open this frame. + + Returns + ------- + bool + Returns true if the avatar's mouth should be considered open this frame + """ + # Grab a window of mic data into integer representations + data = self.stream.read(self.frames, exception_on_overflow=False) + a_data = array.array("h", data) + + # Return true if we're over threshold or it's been fewer than smooth + # checks since the last time we were over threshold. + if max(a_data) > self.threshold: + self.this_smooth = self.smoothing + return True + elif self.this_smooth: + self.this_smooth -= 1 + return True + + return False + + +class Tube: + """Encapsulates avatar and windowing logic, handles drawing. + + Methods + ------- + should_blink() -> bool + Evaluate all available settings and data and return true if the avatar should be blinking this frame. + update() + Perform one frame of checks, logic, and prep and draw the avatar. + """ + + def __init__( + self, + image_closed: str, + image_open: typing.Optional[str] = None, + image_blink_closed: typing.Optional[str] = None, + image_blink_open: typing.Optional[str] = None, + blink_chance: int = 0, + blink_frames: int = 0, + shake_intensity: int = 0, + shake_delay: int = 0, + win_size: typing.Optional[tuple] = None, + bg_color: tuple = (0, 255, 0), + ) -> None: + """ + Parameters + ---------- + image_closed : string + Filename for the image to load for a opened eyes/closed mouth avatar state. + Mandatory. + image_open : string or None + Filename for the image to load for a opened eyes/opened mouth avatar state. + Defaults to None. + image_blink_closed : string or None + Filename for the image to load for a closed eyes/closed mouth avatar state. + Defaults to None. + image_blink_open : string + Filename for the image to load for a closed eyes/opened mouth avatar state. + Defaults to None. + blink_chance : float + A chance, out of 1.0, to activate blinking each frame of rendering. + Defaults to 0 to disable blinking. + blink_frames : int + How many frames to hold the eyes-closed state after activating a blink. + Defaults to 0 to disable blinking. + shake_intensity : int + How far in pixels to limit movement on each axis when shaking. + Defaults to 0 to disable shaking. + shake_delay : int + How many frames to prevent shaking after completing a prior shake. + Defaults to 0. + win_size : tuple(int, int) or None + Window size as a tuple of ints (width, height). + Defaults to None, which uses the image size of image_closed instead. + bg_color : tuple(int, int, int) + Background color as a tuple of ints (red, green, blue). + Defaults to (0, 255, 0) for greenscreen green. + """ + self.open_frames = 0 + self.blinked_frames = 0 + + # These display settings are temporary. We can't load images without it + self.display = pygame.display.set_mode((400, 300)) + pygame.display.set_caption("pyngtube") + + # Closed mouth image is mandatory + self.image_closed = pygame.image.load(image_closed).convert_alpha() + + # Open mouth image is optional + if image_open: + self.image_open = pygame.image.load(image_open).convert_alpha() + else: + self.image_open = None + + # Blinking images are optional + if image_blink_closed: + self.image_blink_closed = pygame.image.load(image_blink_closed).convert_alpha() + else: + self.image_blink_closed = None + + if image_blink_open: + self.image_blink_open = pygame.image.load(image_blink_open).convert_alpha() + else: + self.image_blink_open = None + + # Establish our blink settings + self.blink_chance = blink_chance + self.blink_frames = blink_frames + + # If we only have a closed-mouth image (why?) use it for open-mouth too + using_image_open = self.image_open if self.image_open else self.image_closed + + # State map for which image to use. (Mouth Opened?, Blinked?) + # If we don't have blinked images, we use the non-blinked images for those slots + self.state_map = { + (False, False): self.image_closed, + (True, False): using_image_open, + (False, True): self.image_blink_closed if self.image_blink_closed else self.image_closed, + (True, True): self.image_blink_open if self.image_blink_open else using_image_open, + } + + # If we specified a window size, use it. If not use closed image size + # Then we pad by 10px to allow for shaking + if not win_size: + win_size = self.image_closed.get_size() + + self.win_size = ( + win_size[0] + (shake_intensity * 2), + win_size[1] + (shake_intensity * 2), + ) + + # Background color to blank with, likely a greenscreen color + self.bg_color = bg_color + + # Shaking intensity. Numer of frames to wait before shaking + self.shake_intensity = shake_intensity + self.shake_delay = shake_delay + + # Create our drawing buffer and initialize true display size + self.buf = pygame.surface.Surface(self.image_closed.get_size()) + self.display = pygame.display.set_mode(self.win_size, pygame.RESIZABLE) + + def should_blink(self) -> bool: + """Check if we're currently blinking and check blink_chance to indicate if we should be blinking this frame. + + Returns + ------- + bool + Returns true if the avatar's eyes should be considered closed this frame. + """ + # If we're blinking and haven't reached the configured blink duration yet + # extend blinking by a frame to keep the eyes shut. + # Otherwise roll blink chance and set blink if the roll wins. + if self.blinked_frames and self.blinked_frames < self.blink_frames: + self.blinked_frames += 1 + return True + else: + if random.random() < self.blink_chance: + self.blinked_frames += 1 + return True + + self.blinked_frames = 0 + return False + + def update(self, opened: bool, blinked: bool) -> None: + """Perform updates and drawing for one frame. This should be called once a frame and fed avatar state info on + open mouth and blinking. This handles all the other placement, style, and drawing functions from there. + + Parameters + ---------- + opened : bool + true if the avatar's mouth should be open this frame. + blinked : bool + true if the avatar's eyes should be closed this frame. + """ + # Blank with bg_color + self.display.fill(self.bg_color) + self.buf.fill(self.bg_color) + + # Jitter tracks our offset from origin for shaking + # It's applied as a transformation to image location each frame + jitter = (0, 0) + i = self.shake_intensity + + # Figure out which image we're drawing this frame using our state map + this_frame_image = self.state_map[(opened, blinked)] + + # If the mic state is opened and an open mouth image exists, blit it + # to the drawing buffer. Otherwise use the closed mouth image. + if opened and self.image_open: + self.open_frames += 1 + + # Randomize our jitter if we're currently shaking + if self.shake_delay and not self.open_frames % self.shake_delay: + jitter = (random.randint(-i, i), random.randint(-i, i)) + else: + self.open_frames = 0 + jitter = (0, 0) + + # Draw our chosen image to a drawing buffer + self.buf.blit(this_frame_image, (0, 0)) + + # Resize the drawing buffer to the window size if necessary + # Scale on the vertical axis to maximum window size. + # If we're configured to shake the sprite on open mic, randomly + # jitter the sprite around the frame a bit while mic is open. + display_size = self.display.get_size() + shrunk_size = (display_size[0] - (i * 2), display_size[1] - (i * 2)) + if self.buf.get_size() != shrunk_size: + new_buf = pygame.transform.scale(self.buf, shrunk_size) + self.display.blit(new_buf, (i + jitter[0], i + jitter[1])) + else: + self.display.blit(self.buf, (i + jitter[0], i + jitter[1])) + + pygame.display.flip() + + +if __name__ == "__main__": + # Hide Pygame "Hello world" stuff. Why does this require an envvar :/ + + # I use some not-great extreme shorthand in the main loop here: + # c = config object + # a = audio subsystem object + # t = pngtube object, the window and avatar object + quit = False + rehash = True + root = tkinter.Tk() + root.withdraw() + pygame.init() + + # Load config + c = config.Config() + + # Initialize audio system + a = Audio( + channels=2 if c.mic_stereo else 1, + rate=c.mic_rate, + frames=int(c.mic_rate / c.audio_checks), + threshold=c.threshold, + smoothing=c.smoothing, + ) + + while not quit: + # Reload our avatar if we've been instructed to rehash settings + if rehash: + t = Tube( + image_closed=c.image_closed, + image_open=c.image_open, + image_blink_closed=c.image_blink_closed, + image_blink_open=c.image_blink_open, + blink_chance=c.blink_chance, + blink_frames=c.blink_frames, + bg_color=c.bg_color, + shake_delay=c.shake_delay, + shake_intensity=c.shake_intensity, + ) + rehash = False + for event in pygame.event.get(): + # Clean shutdown if user clicks the [X] + if event.type == QUIT: + pygame.quit() + a.stream.close() + quit = True + break + # If the window gets resized, handle redrawing the window in the new size + elif event.type == pygame.VIDEORESIZE: + # Scale only on vertical axis. Force horizontal axis to match + # the drawing buffer's aspect ratio. + win_size = (event.w, event.h) + buf_ratio = t.buf.get_width() / t.buf.get_height() + t.win_size = (win_size[1] * buf_ratio, win_size[1]) + t.display = pygame.display.set_mode(t.win_size, pygame.RESIZABLE) + # Mouse wheel up and down changes mic threshold. + # TODO: Provide some kind of user feedback + # But I don't want it visible in the capture region + elif event.type == pygame.MOUSEWHEEL: + a.threshold += event.y * 100 + print(a.threshold) + # RMB to open load file dialog for profile + elif event.type == pygame.MOUSEBUTTONDOWN: + if event.button == 3: + file_types = [("yaml files", "profile.yaml")] + file_path = filedialog.askopenfilename(title="Select a profile", filetypes=file_types) + if file_path: + c.profile = os.path.dirname(file_path) + rehash = True + + # Check mic, update image + if not quit: + t.update(a.get_state(), t.should_blink()) diff --git a/ptv.yaml b/ptv.yaml new file mode 100644 index 0000000..66fd551 --- /dev/null +++ b/ptv.yaml @@ -0,0 +1,36 @@ +--- +# Which profile to use by default +# This must be a directory that contains a profile.yaml and all +# the avatar images +profile: 'default' + +# How many times per second audio is checked +# This also impacts FPS of the avatar +audio_checks: 60 + +# Mic amplitude level we consider "Talking" +threshold: 5000 + +# Once the avatar's mouth is open it remains open for at minimum +# this many frames +smoothing: 3 + +# Microphone audio rate. This should match the inpute rate configured +# in Windows. In 99.9% of cases 44100 is the right answer. +mic_rate: 44100 + +# If the mic is stereo or not. true or false +mic_stereo: true + +# How many frames we wait between shakes if avatar shaking is configured +shake_delay: 6 + +# How many pixels in a random direction we shake the avatar each shake +# Setting this to 0 disables shaking +shake_intensity: 0 + +# Chance per frame to have the avatar blink. 0 disables this +blink_chance: 0.003 + +# How many frames, once we blink, we display the eyes closed images +blink_frames: 10 diff --git a/pyproject.toml b/pyproject.toml new file mode 100644 index 0000000..d29030f --- /dev/null +++ b/pyproject.toml @@ -0,0 +1,4 @@ +[tool.black] +line-length = 120 +target-version = ['py310'] +include = '\.pyw?$' diff --git a/requirements.txt b/requirements.txt new file mode 100644 index 0000000..b3c95cd --- /dev/null +++ b/requirements.txt @@ -0,0 +1,6 @@ +numpy==1.23.4 +Pillow==9.3.0 +PyAudio==0.2.12 +pygame==2.1.2 +PyYAML==6.0 +six==1.16.0