Video Production with Streamtasks

Claudio Ctin1 month ago14 mins

Since the inception of video production, we have faced a significant challenge: processing vast amounts of data. Initially, computers were too slow, leading to the emergence of an industry focused on building specialized hardware boxes for various tasks such as encoding, decoding, transmitting, receiving, switching, and displaying video. These boxes became indispensable, making video production an expensive endeavor.

Today, however, we have powerful general-purpose hardware, such as GPUs and CPUs, capable of handling the video and audio data we need to process. But the industry is still behind on this. Why do video encoder boxes still exist when GPUs can perform these tasks?

So what is the problem?

Integration and Software. We have little bits and pieces of software that can do some things, but not others. For simple recording tasks, tools like OBS suffice. For more complex switching, tools like vMix are used. All those tools have one thing in common: They are not as flexible as plugging in some wires wherever you want. You are stuck with what is handed to you.

The Solution

We make boxes and wires in software. Instead of buying encoder boxes, you encode your video with little programs and connect these little programs with virtual wires. This it the idea of streamtasks. You can even change what codec you want, what resolution, what pixel formats, whatever this little piece of software („Task“) allows you to change. To get a better understanding of how the software works I recommend watching the overview video.

Streamtasks allows you to connect topics, which are displayed as colored circles, with eachother. During operation, the task will process the input data in realtime. For example, a video encoder will encode the raw video it receives and then output the encoded video on the output topic.

What you can do with streamtasks depends on two things:

the tasks available
your imagination

While I cannot improve your imagination, I can make tasks and simplify the process of creating new ones.
Right now, there are over 50 different tasks integrated into streamtasks. These range from very simple to more complex and powerful tasks, many of which are documented here.

The general idea is to give the user as much control as possible. If a task can be split into two simpler ones, it is usually a good idea.

If you can write software, you can write your own tasks. The simplest task can be written in 4 lines of code (even less, if you really try). If you are interested in making your own tasks, I refer you to the documentation: https://leopf.github.io/streamtasks/custom-task.html

AI

One of the greatest advancements of our time is AI, which has not yet been widely integrated into video production. Streamtasks provides a strong framework for integrating AI and other kinds of software into your production pipelines. You can implement AI into you workflows today and build things like:

Live Audio Transcription
Live Face Detection
Face Tracking
Text To Speech

You can watch this Demo of a chatbot with TTS to get an Idea of what is possible.

Distributing Data and Workloads

While our computers are really fast today, there is a limit to their performance. In order to build larger production pipelines that process more data, you will have to distribute data and workload accross multiple machines.

This is possible with streamtasks today.

You can have mutliple instances of streamtasks running on different machines and connect them by either using the connection manager UI, or specifying a connect/serve url on the command line.

 # the main instance (started with -C) accepting tcp connections on 9002 
 streamtasks -C –serve tcp://0.0.0.0:9002
 # a sub instance connecting to the main system (in this case running on the same machine)
 streamtasks –connect tcp://127.0.0.1:9002

Thats it. Now you can create your pipelines with all tasks available on either machine. The only apparent difference between tasks running different machines is a dotted line connecting their topics.

Each task available on both machines will now be displayed twice in the task selection. Once for each node.

As a result you can now start tasks on different machines and connect them seemlessly.

In this example we have two different audio inputs (like microphones) one running on node/machine “demo1” and the other running on “demo2”. This audio is mixed and then played on an audio output on “demo1”. Be cautious! The audio data from the audio input on demo2 is being sent over TCP. If you send raw video over a TCP connection you will likely not get the result you wish for. 1080p 30fps RGB24 video is 1492Mb/s of data. If you do not have enough bandwidth, it won’t work.

Distributing Workloads

By having tasks run on different machines, you can distribute heavy workloads, like encoding video. This can be done while still extracting useful information like the amount of activity in a video (how many and how much pixels have changed), before the video is encoded.

In the following example we have two video inputs (like cameras). One on each node. The video activity is measured, in order to switch between the video feeds, and the video is encoded on each node. The encoded video data from demo2 is then sent over to demo1, where we switch between the video feeds based on their activity. The most active video is then displayed in a video viewer.

This allows us to use one machine as an encoder for each video feed. When using GPU Hardware encoders, each machine can usually encode much more than just one video feed. As we scale up, we will want to distribute the load across machines.

Distributing Data

Local internet connections are often very limited. Even if you are lucky to get 1Gb/s upload speed, it still does not compare to the kind of speed you can archive from a server running in a datacenter.

When streaming to multiple streaming services at the same time, you might want to first send your video to a server in a datacenter and from there distribute it to multiple streaming services.

In the following examples we have a video input running on our computer. The video data is encoded and then sent to 3 different output containers. These output containers will then each publish the video feed to a streaming service as a rtmp stream (see the live streaming demo).

The streamtasks network is smart enough to only send the video feed once to our server running in the datacenter, despite there being three tasks receiving the data. As a result we are only using a third of the upload bandwidth on our local machine, while streaming to three different services.

Building Dashboards

The more complex your production pipelines are, the more oversight you need in order to monitor them. Streamtasks allows you to create dashboards that consist of windows, each window is the “display” of a task. This can be the video from a video viewer, the text of a text display, the switch of a switch UI.

The following example is a kill switch. The timestamp updater moves the time of the video feed 5s into the future. When synchronized with the other video feeds this causes the actual feed to be pushed back 5s in time. That way you can preview the video in the live view and then switch the output to a static image 5s before it is displayed.

The video viewers each display a video, which can be arranged in the dashboard. The switch UI, used to turn on/off the video, displays a control element.

More Examples

Since you are free to combine any tasks as you wish, you can built awesome things with it. Here are some examples:

You can implement video switching by having multiple video streams and using a switch to switch between them. All you need to actually switch is a way of controlling the switch like radio buttons, a little UI that displays some options and lets you select one. The switch will always switches to the video feed that has the largest number as its control signal.

Reducing the volume of ambient sound (like stadium microphones) when someone is speaking.

Switching the video of podcasts by always switching to the loudest speaker

Playing sounds
Creating chatbots that speak

If you are interested in seeing more working examples, you can watch them on Youtube.

Please follow and like us:

Stiri similare

IaC Security Analysis: Checkov vs. tfsec vs. Terrascan – A Comparative Evaluation

Claudio Ctin1 hour ago56 mins ago

Traditional, manual security processes can’t keep up with the speed of modern development, which leaves systems vulnerable to attacks. That’s where Security as Code (SaC) comes in. SaC automates security checks and policies, making them an integral part of the development pipeline. This ensures that security is built into every step without slowing down progress….

Terraform – Overview

Claudio Ctin1 hour ago56 mins ago

What is Terraform? 🤔 Terraform is an open-source, cloud-agnostic, one of the most popular Infrastructure-as-code (IaC) tool developed by HashiCorp. It is used by DevOps teams to automate infrastructure tasks such as provisioning of your cloud resources. Terraform supported immutable infrastructure, a declarative language, a masterless and agentless architecture, and had a large community and…

Why is Nextjs so popular in startups?

Claudio Ctin2 hours ago56 mins ago

Nextjs is quite popular with startups. What are the main features of Next.js that make startups adopt it? Please follow and like us:

10 Essential Shadcn Components Every Developer Should Know About

Claudio Ctin2 hours ago56 mins ago

Living in the web development world, it may just feel like digging for treasure until you find your tools. If you’re just starting now, well, you’re in luck because Shadcn is packed full of awesome components that can make life so much easier. If you are a professional or a beginner, knowing these 10 essential…

Execute Syntax on Background Process

Claudio Ctin2 hours ago55 mins ago

Pada postingan ini kita akan belajar cara remot ssh dan menjalankan perintah di balik layar atau background proses dengan 2 tools. Tools yg pertama akan kita bahas adalah tmux dan yg kedua adalah screen. Kedua alat tersebut adalah terminal multiplexer yang digunakan untuk multi tasking di terminal Linux. Okeh kita bahas cara penggunaanya. Saya asumsikan…

Scala is one of the best ways to learn Haskell

Claudio Ctin2 hours ago55 mins ago

First, a reminder: You DON’T NEED TO learn Haskell — that’s not what I want to talk about. I genuinely want to talk about learning Haskell, and Haskell specifically — not functional programming. The following might interest you if you want to get into Haskell for personal reasons (if you think it’s cool, well-paid, or…

What is the Difference Between Radix UI and ShadCN?

Claudio Ctin2 hours ago55 mins ago

Be it a seasoned pro or a newcomer, UI library selection could make all the difference in any of your projects. Today, let’s take a deeper look at two of the top contenders: Radix UI and ShadCN. Both these libraries have something different to bring to the table, and by the end of this post,…

AuthorizationEndpoint vs TokenEndpoint

Claudio Ctin2 hours ago55 mins ago

In OAuth 2.0, the AuthorizationEndpoint and TokenEndpoint serve different roles in the process of obtaining access to resources on behalf of a user. Here’s a breakdown of the differences between them: 1. Authorization Endpoint Purpose: The AuthorizationEndpoint is responsible for obtaining authorization from the user to access their resources. This is where the user grants…

Custom Bootstrap 5 Breadcrumbs -Ver 2

Claudio Ctin2 hours ago55 mins ago

Custom Breadcrumbs for Bootstrap 5 framework Abstract: We are presenting code (CSS) for custom Bootstrap 5 breadcrumbs. This is an improved version of the previously published article. 1 The need for better Breadcrumbs Bootstrap 5 framework is coming with very basic Breadcrumbs implementation. I needed something much better, both visually and more functional. Over time,…

A Comprehensive Guide to Appium 2.0 Migration

Claudio Ctin2 hours ago54 mins ago

In mobile app development, keeping up with the latest advancements in testing tools is crucial. A significant upgrade in automation testing is the release of Appium 2.0. With its myriad enhancements and new features, migrating to Appium 2 is essential for teams aiming to maintain a competitive edge. This Appium 2.0 Migration blog will guide…

FluxCD – A lightweight GitOps CD tool: Day 44 of 50 days DevOps Tools Series

Claudio Ctin2 hours ago54 mins ago

Welcome to Day 44 of our “50 DevOps Tools in 50 Days” series! Today, we are exploring FluxCD, one of the most popular tools in the GitOps ecosystem. FluxCD automates the process of deploying applications to Kubernetes, allowing for seamless continuous delivery and progressive deployments. It uses Git as the single source of truth, ensuring…

🌐 SSL Certificates and How to Implement Them in Your Website 🔐

Claudio Ctin2 hours ago54 mins ago

In today’s web development world, ensuring security is critical. As developers, safeguarding user data and building trust with our audience are essential. One powerful way to achieve this is by implementing an SSL (Secure Sockets Layer) certificate on our websites. 🔒 In this post, we’ll dive into: What SSL is 🤔 Why it’s important 🌟…

Spring Boot RestTemplate getForEntity method

Claudio Ctin3 hours ago3 hours ago

Producer application Dependencies Spring Web and Lombok pom.xml <?xml version=”1.0″ encoding=”UTF-8″?> <project xmlns=”http://maven.apache.org/POM/4.0.0″ xmlns:xsi=”http://www.w3.org/2001/XMLSchema-instance” xsi:schemaLocation=”http://maven.apache.org/POM/4.0.0 https://maven.apache.org/xsd/maven-4.0.0.xsd”> <modelVersion>4.0.0</modelVersion> <parent> <groupId>org.springframework.boot</groupId> <artifactId>spring-boot-starter-parent</artifactId> <version>3.2.5</version> <relativePath/> <!– lookup parent from repository –> </parent> <groupId>com.example</groupId> <artifactId>ProducerApp</artifactId> <version>0.0.1-SNAPSHOT</version> <name>ProducerApp</name> <description>Demo project for Spring Boot</description> <properties> <java.version>17</java.version> </properties> <dependencies> <dependency> <groupId>org.springframework.boot</groupId> <artifactId>spring-boot-starter-web</artifactId> </dependency> <dependency> <groupId>org.springframework.boot</groupId> <artifactId>spring-boot-starter-test</artifactId> <scope>test</scope> </dependency> <dependency> <groupId>org.springframework.boot</groupId> <artifactId>spring-boot-starter</artifactId> </dependency>…

Mastering Terraform: How Loops and Conditionals Unlock Smarter Infrastructure Automation

Claudio Ctin4 hours ago3 hours ago

In the ever-growing world of cloud infrastructure, managing and automating deployments has become increasingly complex. This is where Terraform comes in — as one of the most powerful tools in Infrastructure as Code (IaC), Terraform allows engineers to declaratively define and provision infrastructure with ease. However, as configurations grow, keeping them concise and manageable is…

The Power of Machine Learning in Angular: A Developer’s Guide

Claudio Ctin4 hours ago3 hours ago

The intersection of Machine Learning (ML) and Artificial Intelligence (AI) with Angular is a game-changer for modern web applications. This article delves into how you can integrate AI and ML into your Angular projects, breaking it down with practical examples, real-world use cases, and helpful tips to guide you along the way. please subscribe to…

Introducing HTPX: A Lightweight and Versatile HTTP Client for JavaScript and Node.js

Claudio Ctin4 hours ago3 hours ago

As developers, we often need a reliable and efficient HTTP client for our web applications, whether we’re building with JavaScript in the browser or Node.js on the server side. That’s why I created HTPX — a powerful, lightweight solution designed to simplify HTTP requests while offering a range of features for modern development. In this…

Advanced Terraform Module Usage: Versioning, Nesting, and Reuse Across Environments

Claudio Ctin4 hours ago3 hours ago

Terraform, HashiCorp’s popular Infrastructure as Code (IaC) tool, has transformed how infrastructure is provisioned and managed. One of its most powerful features is the ability to create and use modules — reusable configurations that encapsulate sets of resources. For teams looking to scale and manage their infrastructure efficiently, mastering advanced Terraform module usage — including…

10 sec for UX Design Principles.

Claudio Ctin4 hours ago3 hours ago

DFFH RULE 1.D- Doherty Threshold : Users somehow replicate the computer’s working speed to navigate the components in the design. Tend to keep the waiting time for users at less than 400 ms. The loading time can be utilized to show visual animations of loading or task-processing phenomena to build trust. 2.F- Fitt’s Law :…