Non-Functional Testing: Load and Stress Tests with K6

Claudio Ctin3 hours ago3 hours ago20 mins

Overview

In this article, you’ll understand the crucial role of non-functional testing for your software’s overall performance and reliability. We’ll particularly focus on load and stress testing, highlighting why these tests are essential in ensuring your application can handle real-world traffic conditions and beyond. By the end, you’ll appreciate the importance of stress testing in identifying breaking points and ensuring robustness under peak loads.

What is Non-Functional Testing

Some Types of Non-Functional Testing

Performance Testing
Load Testing
Stress Testing
Reliability Testing
Security Testing

Which Metrics to Monitor on Your Server

What is k6

Getting Started with k6

The Project We Will Test
API Code
K6 Tests Using Javascript
Running Your k6 Test
System Metrics Before and During Load Test
K6 Native Dashboard Results
How to Analyze the K6 Dashboard Metrics
Understanding the Metrics

Where can I send my K6 Metrics and Logs

How to Use k6 Tests in Production

References

What is Non Functional Testing

First, we need to understand what functional and non-functional requirements are. It’s important to recognize that both are relevant to an application.

Functional requirements: Define things the system must do, features, and behaviors that fulfill business needs.

Send an email after user registration.
Show financial dashboards with charts in real time.

Non-functional requirements: Define the quality of the system: performance, security, reliability, scalability, etc…

The webpage should be loaded in 1.8 seconds or less for any user. FCP <= 1.8s (performance)
The API financial data updates should be able to return for 1 million users simultaneously in less than 3 seconds. (scalability and performance)

As you can imagine, if the non-functional requirements are not met, users may become frustrated and eventually stop using the application. This is just one example: “Slow pages can increase bounces”.

Some Types of Non Functional Testing

Performance Testing

Evaluate how well the system performs under various conditions, measuring response times of web pages under different user loads.

Load Testing

Tests the system’s behavior under expected load conditions.

Example: Simulate 10,000 concurrent users accessing an e-commerce website.

Stress Testing

Pushes the system beyond its normal operational limits to evaluate its stability.
Example: Increase database transactions exponentially to test system crash points.

Reliability Testing

Ensures the system functions correctly and reliably under different conditions.
Example: Verify that the system recovers gracefully from server failures.

Security Testing

Identifies vulnerabilities and ensures data protection within the system.
Example: Conduct penetration tests to uncover weaknesses in authentication.

Which metrics to monitor on your server

System Metrics:

CPU Usage: Identify if the server is under heavy load, which can lead to performance degradation or bottlenecks in processing tasks.

Memory Usage: Identify memory leaks or insufficient memory allocation, which can cause slowdowns or crashes when memory is exhausted.

Disk Usage: Identify when storage is nearing capacity, which can lead to application failures or data loss if not managed properly.

Application Metrics:

Request Rate: Monitor requests per minute (RPM) and requests per second (RPS).

Response Time: Evaluate the time it takes for your API to respond to requests. This helps identify performance issues.

Error Metrics:

Error counts: 4xx, 5xx
Many 4xx errors may indicate poorly written documentation for your API consumers.
An increase in 5xx errors suggests server-side issues that must be addressed.

Security Metrics:

Failed Login Attempts: Monitor the number of 401 HTTP status codes. This can indicate potential brute force attacks or unauthorized access attempts.

Database Metrics:

Query Performance: Identify queries that fetch more data than necessary, particularly those with excessive joins (SQL) or aggregates (NoSQL).

Active connections: You can identify errors in your logic to connect to your database and also verify if you are closing the connections correctly.

Uptime/Downtime: Monitor the availability of your server.

Tracking Uptime helps ensure that your services are running smoothly.
Tracking Downtime monitoring helps identify and address outages promptly.

Cache Hit Ratio: It’s important to understand whether your data is being served from the cache. If your cache hit ratio is low,
it indicates that the cache is not being effectively utilized, and you may need to reconsider your caching strategy.

These are just some important metrics that you should monitor on your server; there are more.

What is K6

K6 is an open-source load testing tool for the performance and reliability evaluation of web applications and APIs.

Getting Started with k6:

The Project We Will Test

It is a simple, basic NodeJS REST API with docker and docker-compose to manage the container.
A basic Node.js REST API designed to run in a containerized environment using Docker and Docker Compose.
In this implementation, all data is stored in memory for simplicity, while the resource limits for the application are specified in the docker-compose.yml file.

With CPU usage limited to 20% and memory capped at 64MB, limiting resources for load/stress testing locally is a good idea if you want to evaluate how the application behaves under constrained conditions, identify performance bottlenecks, and ensure it can handle unexpected spikes in traffic without crashing.

docker-compose.yml

version: ‘3.8′
services:
app:
build: .
ports:
– “3000:3000″
deploy:
resources:
limits:
cpus: ‘0.2′ # Limit CPU usage to 20%
memory: 64M # Limit memory usage to 64MB

API code

The API code is at the repository: https://github.com/godinhojoao/k6-load-stress-tests-api

K6 Tests Using Javascript

import http from ‘k6/http‘;
import { sleep, group } from ‘k6‘;
import { Trend, Counter } from ‘k6/metrics‘;

// Define trends for response duration for each request type
// https://grafana.com/docs/k6/latest/javascript-api/k6-metrics/trend/
const reqDurationTimeGet = new Trend(‘req_duration_time_get‘, true); // true to use ‘milliseconds’
const reqDurationTimePost = new Trend(‘req_duration_time_post‘, true);
const reqDurationTimeDelete = new Trend(‘req_duration_time_delete‘, true);

// Counter for counting event occurrences, like errors.
const getCounterErrors = new Counter(‘get_errors_counter‘);
const postCounterErrors = new Counter(‘post_errors_counter‘);
const deleteCounterErrors = new Counter(‘delete_errors_counter‘);

// By adjusting the options and sleep time, you can simulate either a load test or a stress test, depending on the configuration.
// Load test: expected usage.
// Stress test: test the system’s breaking point.
export let options = {
duration: ‘1m‘,

// https://grafana.com/docs/k6/latest/misc/glossary/#virtual-user
vus: 50, // Increase the number of Virtual Users to simulate more concurrent requests

// https://grafana.com/docs/k6/latest/using-k6/thresholds/#fail-a-load-test-using-checks
thresholds: { // thresholds are the fail/pass criteria
‘http_req_duration‘: [‘p(95)<200‘], // 95% of requests must finish within 200ms.
‘http_req_failed‘: [‘rate<0.01‘], // less than 1% of failed reqs
‘get_errors_counter‘: [‘count<1‘], // less than 1 error fetching
‘post_errors_counter‘: [‘count<1‘], // less than 1 error creating
‘delete_errors_counter‘: [‘count<1‘], // less than 1 error deleting
},
};

export default function () {
// https://grafana.com/docs/k6/latest/javascript-api/k6-http/set-response-callback/
/* Make a broader check in response status if you want to check the response body
For more detailed checks, use the k6 `check` function for the specific request. */
http.setResponseCallback(http.expectedStatuses({ min: 200, max: 299 }));

// Use groups to organize https://grafana.com/docs/k6/latest/using-k6/tags-and-groups/
group(‘Get Events‘, () => {
const getEventsResponse = http.get(‘http://localhost:3000/events‘);

// Timings of HTTP requests: https://grafana.com/docs/k6/latest/examples/get-timings-for-an-http-metric/
const getResponseDuration = getEventsResponse.timings.duration;
reqDurationTimeGet.add(getResponseDuration);

if (getEventsResponse.status < 200 || getEventsResponse.status >= 300) {
getCounterErrors.add(1);
}
});

const postResponseTime = postEventsResponse.timings.duration;
reqDurationTimePost.add(postResponseTime);

if (postEventsResponse.status < 200 || postEventsResponse.status >= 300) {
postCounterErrors.add(1);
}
});

if (createdEventId) {
group(‘Delete Event‘, () => {
const deleteEventResponse = http.del(`http://localhost:3000/events/${createdEventId}`);

const deleteResponseTime = deleteEventResponse.timings.duration;
reqDurationTimeDelete.add(deleteResponseTime);

if (deleteEventResponse.status < 200 || deleteEventResponse.status >= 300) {
deleteCounterErrors.add(1);
}
});
}

sleep(1); // simulate realistic user behavior (wait 1 second after requests)
// if you want a burst of requests use a short sleep –> sleep(0.1);
}

// There are many other resources to explore, for example:
// – Handling error: https://grafana.com/docs/k6/latest/examples/error-handler/
// – Custom summary report: https://grafana.com/docs/k6/latest/results-output/end-of-test/custom-summary/
// And much more that you can find at: https://grafana.com/docs/k6/latest

Running Your k6 Test

First of all install k6

MacOS brew install k6

Write your tests
Execute your tests and export logs to a JSON file (this will also generate an HTML report and open a real-time dashboard):

K6_WEB_DASHBOARD=true K6_WEB_DASHBOARD_OPEN=true K6_WEB_DASHBOARD_EXPORT=html-report.html k6 run –http-debug api_k6_test.js –out json=k6-logs.json

Analyze Results:

To monitor real-time resource usage: docker stats container_id, to discover container_id use docker ps.
To view k6 metrics: use the web dashboard, read logs, or integrate with monitoring tools like New Relic, Grafana, etc.

System Metrics Before and During Load Test

You will notice that these metrics fluctuate. Additionally, using docker stats is not the best approach to analyze your system metrics; I am using it locally and limiting resources.

Before running k6 tests:

During k6 tests:

K6 Native Dashboard Results

Click here to see the gif on GitHub.

How to Analyze the K6 Dashboard Metrics

First of all we need to understand the measures.

Units:

1 second (s) = 1,000 milliseconds (ms)

1 millisecond (ms) = 1,000 microseconds (µs)

1 microsecond (µs) = 1,000 nanoseconds (ns)

Response Time Metrics

avg: Average response time of all requests.

med: Median response time; 50% of requests completed in this time or less.

max: Maximum response time observed across all requests.

min: Minimum response time observed across all requests.

p90: 90th percentile response time; 90% of requests completed in this time or less.

p95: 95th percentile response time; 95% of requests completed in this time or less.

p99: 99th percentile response time; 99% of requests completed in this time or less.

Analyzing both average (avg) and median (med) response times is crucial. A single slow response can distort the average, while the median reflects typical performance. Reviewing maximum (max) and minimum (min) times provides context for the best and worst scenarios.

Understanding the Metrics

this image is from k6 documentation

Where can I send my K6 Metrics and Logs

Grafana: Visualization tool for performance metrics.

New Relic: Cloud-based platform for application performance insights.

InfluxDB: Time-series database for storing time-stamped metrics.

Datadog: Real-time monitoring service for application metrics.

Prometheus: Open-source toolkit for monitoring and alerting.

AWS CloudWatch: Monitoring service for AWS resources.

Azure Monitor: Comprehensive monitoring service for Azure resources and applications.

Google Cloud Monitoring: Visibility tool for Google Cloud applications.

You can send alerts for metric thresholds and anomalies in most of these services, allowing you to take action when performance issues arise.

How to Use k6 Tests in Production

Integrating with Cloud Monitoring Tools: You can integrate k6 with various cloud monitoring services like New Relic, Grafana Cloud, and AWS CloudWatch. These tools help you visualize test results, track performance metrics, and monitor system health. To set up integration, follow the documentation for each tool, usually involving API keys and configuration settings. This allows you to send your k6 metrics and logs directly to these platforms for real-time analysis and alerts.

If your infrastructure is on AWS, Azure, etc.. Probably you already have at least the system metrics you will need.

Avoid Overloading Your Server: Use a staging environment that mirrors production with the same resources and configurations. If testing directly in production, limit user load and monitor server performance to prevent real user impact.

Analyze Results: After running your tests, analyze the data to find areas for improvement.

Bonus: You can integrate your CI/CD pipelines to run performance tests automatically using k6. This allows you to ensure the stability and performance of your application with every code change or deployment.

References

https://www.thinkwithgoogle.com/marketing-strategies/app-and-mobile/mobile-page-speed-load-time/
https://www.geeksforgeeks.org/non-functional-requirements-in-software-engineering/
https://sematext.com/blog/api-monitoring/
https://www.blobr.io/post/key-api-metrics
https://apitoolkit.io/blog/the-most-important-metric/
https://www.geeksforgeeks.org/software-testing-non-functional-testing/
https://grafana.com/docs/k6/latest
https://grafana.com/blog/2023/04/11/how-to-visualize-load-testing-results/

Thanks for Reading!

Feel free to reach out if you have any questions, feedback, or suggestions. Your engagement is appreciated!

Contacts

You can find this and more content on:

My website
GitHub
LinkedIn
Dev Community

Please follow and like us:

Stiri similare

급할수록 돌아가라

Claudio Ctin1 hour ago49 mins ago

이 글은 제 이전 블로그에 2024년 2월 19일에 올렸던 글입니다. https://codenested.blogspot.com/2024/02/blog-post.html 간혹 옛 성현들의 말씀 중에 틀린게 없다는걸 새삼 실감할 때가 있습니다. 최근에 그런걸 다시 한 번 느꼈는데…… 급할수록 돌아가라 하셔서 돌아갔더니만 궁극적으로 시간을 더 절약하게 되더군요. 상황은 대충 이렇습니다. 특정 데이터를 추출하는 새로운 기능을 만들어야 하는데, 겉으로 보기엔 간단하지만 실제로는 간단하지 않겠다는 생각이 들었습니다….

How Machine Learning Has Changed the Plumbing World Forever

Claudio Ctin1 hour ago49 mins ago

Machine learning (ML) has been transforming various industries by introducing innovative ways to process data, optimise workflows, and improve decision-making. While plumbing might not seem like an obvious field for such advancements, machine learning has significantly impacted the industry, bringing new levels of efficiency, accuracy, and predictive capabilities. From predictive maintenance to optimised customer service,…

EXO: Run Beefy LLMs on Your Grandma’s Flip Phone 📱🧠

Claudio Ctin1 hour ago48 mins ago

What’s up, AI ninjas? Today, we’re diving into the world of Large Language Models (LLMs) and a tool so cool that it makes ice look hot. Say hello to EXO – the gym trainer for your chonky AI models. The LLM Problem (a.k.a. “Why is my laptop on fire?”) 🔥💻 Let’s face it, running LLMs…

Applying SOLID Principles in JavaScript and TypeScript Framework

Claudio Ctin5 hours ago5 hours ago

Introduction The SOLID principles form the foundation of clean, scalable, and maintainable software development. Though these principles originated in Object-Oriented Programming (OOP), they can be effectively applied in JavaScript (JS) and TypeScript (TS) frameworks like React and Angular. This article explains each principle with real-life examples in both JS and TS. 1. Single Responsibility Principle…

openings of Connect6

Claudio Ctin5 hours ago5 hours ago

(TODO) Please follow and like us:

What are Embedded Systems?

Claudio Ctin6 hours ago5 hours ago

Versão PT Embedded systems (or integrated systems) are specialized computing systems designed to perform specific functions within a larger device. They consist of hardware and software dedicated to a predefined task or set of tasks, often with real-time requirements and limited resources. These systems are commonly used in IoT (Internet of Things), which aims to…

O que são Sistemas Embarcados?

Claudio Ctin6 hours ago5 hours ago

English Version Sistemas embarcados (ou sistemas integrados) são sistemas de computação especializados projetados para realizar funções específicas dentro de um dispositivo maior. Eles são compostos por hardware e software dedicados a uma tarefa ou conjunto de tarefas pré-definidas, geralmente com requisitos de tempo real e de recursos limitados. Esses sistemas estão a ser comumente usados…

Bringing the Human Touch to Technical Writing in the AI Age

Claudio Ctin6 hours ago5 hours ago

In this era of rapid technological advancement, where artificial intelligence is automating more and more tasks, the role of technical writing has never been more crucial. As we navigate the vast troves of AI-generated content, one aspect of technical writing stands out as truly invaluable: the power of nuanced, human perspectives. Beyond the Cold, Hard…

Mystical Palm Reader AI

Claudio Ctin6 hours ago5 hours ago

Mystical Palm Reader AI: Unveiling Your Destiny with Pinata and AI This is a submission for the The Pinata Challenge What I Built I developed “Palm Reader AI,” an innovative (and mostly fun) webapp that combines the mystical art of palm reading with cutting-edge AI technology from HuggingFace and the Pinata decentralized storage. This application…

First time on dev… git workshop

Claudio Ctin6 hours ago5 hours ago

Hi, I just made a dev profile to receive notifications of future GitHub Campus Experts program. As a matter of fact, I am running a short workshop about Git and GitHub at my current campus. I am a graduate student btw. Back in college, I should have joined this community but my head was in…

What Every Developer Should Know About Cybersecurity (Especially in the AI Era)

Claudio Ctin6 hours ago5 hours ago

Hi devs, In today’s digital world, cybersecurity has become one of those topics we can’t ignore anymore. It’s no longer just about preventing data breaches or stopping hackers from exploiting your system—it’s about trust, compliance, and the overall integrity of our applications. As developers, we have to embrace this responsibility and bake security into everything…

Comandos GIT

Claudio Ctin6 hours ago5 hours ago

Estados no Git Modificado (modified): Arquivos foram modificados, mas ainda não estão preparados para commit. Preparado (staged/index): Arquivos estão prontos para serem comitados. Consolidado (committed): Alterações foram salvas no repositório local. Ajuda no Git Ajuda Geral: git help Ajuda para um comando específico: git help <comando> Ex.: git help add, git help commit Configurações do…

Fun with Files File-Upload-to-IPFS

Claudio Ctin6 hours ago5 hours ago

Overview Fun with Files is a web application that allows users to upload files to the InterPlanetary File System (IPFS) using Pinata’s API. This application provides a user-friendly interface with drag-and-drop functionality, real-time upload feedback, and displays the IPFS hash for easy access to uploaded files. Technologies Used Frontend: HTML, CSS, JavaScript Backend: Node.js, Express…

14 rarely known Useful HTML Tags

Claudio Ctin21 hours ago21 hours ago

Hey there, fellow UI developers! Ready to dive into the world of lesser-known HTML tags? We all know the usual suspects like <div>, <p>, and <a>, but today we’re going to explore some hidden gems that can make your coding life easier and your websites more accessible. Let’s uncover these html unique tags together! Why…

A Friendly Guide to Learning TypeScript Step by Step

Claudio Ctin21 hours ago21 hours ago

Welcome to the World of TypeScript! Hey there, fellow front-end developer! Are you ready to take your JavaScript skills to the next level? If so, you’re in the right place. Today, we’re going to embark on an exciting journey into the world of TypeScript. Don’t worry if you’re feeling a bit nervous – we’ll take…

Array Data Structure with Time and Space Complexity.

Claudio Ctin21 hours ago21 hours ago

Array Data Structure with Time and Space Complexity. Please follow and like us: