Speeding Up Data on AWS: From Ingestion to Insights

Claudio Ctin1 month ago26 mins

In a production-scale cloud environment, data is scattered across various storage formats and locations, such as RDS databases, DynamoDB tables, time series databases, S3 files, and external systems. While Amazon QuickSight can directly connect to many data sources, it is often not preferred due to design principles, costs, performance, and user experience. Instead, the best practice is to build a centralized data lake with tools to consolidate and transform data for business intelligence tools. But how can you optimize the data pipeline from ingestion to insights to ensure processed data is ready for analysis as quickly as possible?

In this article, we use real-world open data sets on Helsinki region public traffic, imported as DynamoDB tables. We showcase, how we can transform data from the source to a Data Lake in S3, combine the Data Sets in QuickSight to create interesting and actionable insights, and eventually, how we can speed up the Data Pipeline to ensure the insights are always as up-to-date as possible.

Anatomy of a typical Serverless Data Pipeline on AWS

NordHero has implemented data pipelines for various customers on AWS utilizing our Data to Insights Jump Start offering. The solution uses

AWS Glue Jobs to extract data from their sources, transform the data to be efficiently utilized with BI tools, and load the data in Parquet or ORC format to a data lake based on Amazon S3

AWS Glue Crawlers to determine the data lake schemas and to store the schemas in AWS Glue Data Catalog

Amazon Athena to provide a scalable and super-fast SQL interface to the data stored in the S3 data lake

AWS QuickSight to analyze the data, build actionable insights on the data, and deliver the insights to business users

AWS Glue is an AWS-managed service, meaning that AWS manages the needed compute instances, their software, and the scaling of the resources. You only pay for your data’s processing time. You can create and run several AWS Glue jobs to extract, transform, and load (ETL) data from various data sources into the data lake and build different curated datasets in the data lake for various data consumption needs.

Amazon S3 is an ideal service to be used as the storage foundation for a data lake, providing several benefits:

Scalability and Elasticity: Amazon S3 can scale massively to store virtually unlimited amounts of data, without the need for provisioning or managing storage infrastructure.

Data Lake Architecture: S3 enables a decoupled storage and compute architecture, allowing you to store data in its raw form and use various analytics services and tools to process and analyze the data without being tied to a specific compute engine. S3 integrates seamlessly with various AWS analytics services like Amazon Athena, AWS Glue, Amazon EMR, Amazon QuickSight, and AWS Lake Formation, enabling you to build end-to-end data processing and analytics pipelines.

Cost-Effective: Amazon S3 offers a cost-effective storage solution, with pricing based on the amount of data stored and accessed. You can also leverage different storage classes (e.g., S3 Glacier) for cost optimization based on data access patterns.

Data Durability and Availability: Amazon S3 is designed for 99.999999999% durability and 99.99% availability, ensuring your data is safe and accessible when needed.

Data Lake Security and Compliance: Amazon S3 provides robust security features, including access control, encryption at rest and in transit, and integration with AWS Identity and Access Management (IAM) for granular permissions management.

Data Sharing and Collaboration: With Amazon S3, you can easily share data across teams, projects, or even with external parties, enabling collaboration and data monetization opportunities.

Centralized Data Repository: A data lake on Amazon S3 serves as a centralized repository for all your structured, semi-structured, and unstructured data, breaking down data silos and enabling data democratization within your organization.

Here’s an example AWS Glue Job script, written in Python, that extracts passenger data from a DynamoDB table named hsl-passengers, transforms the column names from uppercase to lowercase, casts passenger_count field from String to Integer type, and lastly writes the transformed data in an S3 bucket in Parquet format.

import sys
from datetime import datetime, date, timedelta

from awsglue.transforms import *
from awsglue.utils import getResolvedOptions
from pyspark.context import SparkContext
from awsglue.context import GlueContext
from awsglue.job import Job
from pyspark.sql.dataframe import DataFrame
from pyspark.sql.types import IntegerType
from pyspark.sql.functions import col

SPARK_CONTEXT = SparkContext.getOrCreate()
GLUE_CONTEXT = GlueContext(SPARK_CONTEXT)
spark = GLUE_CONTEXT.spark_session
logger = GLUE_CONTEXT.get_logger()

def read_dynamo_db(tablename: str):
“””Reads DynamoDB table into a DynamicFrame“””

dyf = GLUE_CONTEXT.create_dynamic_frame.from_options(
connection_type=“dynamodb“,
connection_options={
“dynamodb.input.tableName“: f“{tablename}“,
“dynamodb.throughput.read.percent“: “0.5“,
“dynamodb.splits“: “1“,
},
)
return dyf

def write_log(message: str):
“””Writes log to multiple outputs“””
logger.warn(message)
print(message)

def write_to_s3(s3_output_base_path: str, name: str, df: DataFrame):
“””Writes data to specific folder in S3“””

path = f“{s3_output_base_path}/{name}“
if not (“processed“ in path):
raise Exception(
“Output folder must contain path element ‘processed‘ to be valid“
)
write_log(f“Writing output to {path}“)
df.write.mode(“overwrite“).format(“parquet“).partitionBy(“object_id“).save(path)

def main():

# @params: [JOB_NAME]
args = getResolvedOptions(
sys.argv,
[“JOB_NAME“, “s3_output_path“],
)

job = Job(GLUE_CONTEXT)
job.init(args[“JOB_NAME“], args)

# Let’s only overwrite partitions that have changed, even though we store all data
spark.conf.set(“spark.sql.sources.partitionOverwriteMode“, “dynamic“)

s3_output_path = args[“s3_output_path“]

write_log(f“Parameter ‘s3_output_path‘: {s3_output_path}“)

# Reading data from DynamoDb
passengers_raw = read_dynamo_db(“hsl-passengers“).toDF()

passengers = (
passengers_raw.withColumnRenamed(“OBJECTID“, “object_id“)
.withColumnRenamed(“SHORTID“, “short_id“)
.withColumnRenamed(“STOPNAME“, “stop_name“)
.withColumn(“passenger_count“, col(“PASSENGERCOUNT“).cast(IntegerType()))
.drop(“PASSENGERCOUNT“)
)

write_to_s3(s3_output_path, “passengers-data“, passengers)

job.commit()

if __name__ == “__main__“:
main()

When you trigger the AWS Glue Job, AWS Glue fires up the needed Apache Spark compute instances, manages the parallel job execution between cluster nodes, and ramps down the compute services after the job execution has finished.

Optimizing the data delivery to the data lake with AWS Glue

You need to consider several things when optimizing the ETL process with AWS Glue, but eventually, it comes down to two criteria. The primary criterion is Time. The data lake is never 100% up-to-date with the source data. So the key question is, how often should the data be updated? The secondary criterion is always Money. After setting the Time criterion, how can the costs of the ETL process be optimized?

To beat the criteria, you need to plan your data pipeline well. Here are a few spices to compete against the clock:

Scale cluster capacity: Adjust the number of Data Processing Units (DPUs) and worker types based on your workload requirements. AWS Glue allows you to scale resources up or down to match the demands of your ETL jobs.

Use the latest AWS Glue version: AWS regularly releases new versions of AWS Glue with performance improvements and new features. Upgrade to the latest version to take advantage of these enhancements.

Reduce data scan: Minimize the amount of data your jobs scan by using techniques like partitioning, caching, and filtering data early in the ETL process.

Parallelize tasks: Divide your ETL tasks into smaller parts and process them concurrently to improve throughput. AWS Glue supports parallelization through features like repartitioning and coalesce operations.

Minimize planning overhead: Reduce the time spent on planning by optimizing your AWS Glue Data Catalog, using the correct data types, and avoiding unnecessary schema changes.

Optimize shuffles: Minimize the amount of data shuffled between tasks, as shuffles can be resource-intensive. Use techniques like repartitioning and coalescing to reduce shuffles.

Optimize user-defined functions (UDFs): If you’re using UDFs, ensure they are efficient and optimize their execution using vectorization and caching.

Use AWS Glue Auto Scaling: Enable AWS Glue Auto Scaling to adjust the number of workers based on your workload automatically, ensuring efficient resource utilization.

Monitor and tune: Use AWS Glue’s monitoring capabilities, such as the Spark UI and CloudWatch metrics, to identify bottlenecks and tune your jobs accordingly.

Leverage AWS Glue Workflow: Use AWS Glue Workflow to orchestrate and manage your ETL pipelines, ensuring efficient execution and resource utilization.

Optimize data formats: Use columnar data formats like Parquet or ORC, which are optimized for analytical workloads and can improve query performance.

Leverage AWS Glue Data Catalog: Use the AWS Glue Data Catalog to store and manage your data schemas, which can improve planning and reduce overhead.

Optimize data compression: Use appropriate compression techniques to reduce the amount of data transferred and stored, improving performance and reducing costs.

Avoid processing the same data multiple times: Use AWS Glue Job bookmarks to track the data already processed by the ETL job, and update only the changed partitions when loading data to the data lake.

Here’s an example of AWS Glue Workflow. The workflow processes Helsinki Region Transport (HSL) open data on passenger amounts and public transport stops and shifts. The workflow has a trigger named hsl-data-glue-workflow-trigger that is configured to start once per hour. The trigger will fire up five parallel AWS Glue Jobs to process data related to shifts, passengers, stoptypes, network and stops. When all these Jobs end up in the SUCCESS state, the hsl-data-glue-crawler-trigger is triggered to start an AWS Glue Crawler to update the data schemas in the AWS Glue Data Catalog.

AWS Glue Workflows support three types of start triggers:

Schedule: The workflow is started according to a defined schedule (e.g., daily, weekly, monthly, or a custom cron expression).

On-demand: The workflow is started manually from the AWS Glue console, API, or AWS CLI.

EventBridge event: The workflow starts with a single Amazon EventBridge event or a batch of Amazon EventBridge events.

Optimizing the data insights experience with Amazon QuickSight

From a data consumption perspective, the key criterion is that the data be up-to-date and instantly available. If there’s lots of data in the data lake, updating an analysis or dashboard view in Amazon QuickSight might take even tens of seconds. That will make the business analytics experience very poor and generate lots of costs.

Amazon QuickSight has solved this issue with a lightning-fast in-memory caching solution called SPICE (Super-fast, Parallel, In-memory Calculation Engine). When configuring QuickSight DataSets, you have the option to either query the underlying data directly or utilize SPICE. QuickSight comes with a 10 GB SPICE allocation per QuickSight Author license, and additional SPICE capacity can be purchased with GB/month pricing.

When using SPICE, the underlying data from the data sources, such as a data lake, is loaded into SPICE. QuickSight Analyses and Dashboards utilize only the version of data available in SPICE. SPICE can be refreshed

manually
by a preconfigured schedule
through QuickSight API

The SPICE refresh timing becomes an issue when targeting to have as recent data available in QuickSight as possible. Consider a situation where the Glue Workflow, containing multiple ETL jobs, runs once per hour and updates several datasets in the S3 data lake. In our imaginary example, the workflow process typically lasts 20 minutes. Still, depending on the amount of changed data in the data sources since the last run and the current utilization level of the AWS-managed Glue hardware, the Workflow run can take between 14 and 40 minutes.

In addition, the QuickSight SPICE refresh process runs on AWS-managed computing resources, and in our case, refreshing one QuickSight DataSet might take 2-8 minutes.

And in a typical production-scale QuickSight environment, the order of DataSet refreshes matter. There are “simple” DataSets that are not dependent on any other DataSet and then there combined DataSets that utilize simple DataSets. Before starting to refresh one DataSet in QuickSight, we need to be sure that all underlying DataSets our DataSet is dependent on have first been refreshed.

Here’s an example of a combined DataSet in QuickSight. All datasets available in data lake have first been brought to QuickSight, and now the passengers data is first joined with stops data while passenger amounts are counted per stop. Then stops data is joined with stoptypes data (whether the stop is a glass shelter, steel shelter, post,…), network data (is it a bus stop, subway stop, tram stop,…) and shifts data (number of public transport shifts between different stops).

Optimizing the whole data pipeline from ingestion to insights with event triggering and AWS Step Functions

So how can we manage this all automatically and with optimal timing? To solve the issues, we need to

use AWS Glue Workflow to automate and order the Jobs and Crawlers within the Glue ETL process
Refresh the QuickSight DataSets into SPICE immediately after Glue Workflow has finished its execution
Refresh the QuickSight DataSets in the correct order so that the “simple” DataSets get updated first and combined DataSets right after the simple ones

Luckily we can achieve the latter two by utilizing AWS StepFunctions and CloudWatch Event Triggering!

Triggering a Step Function when Glue Workflow has finished

AWS Glue Crawlers create CloudWatch Events on their lifecycle changes, and we can trigger an AWS Step Function State Machine execution when the last Crawler in our Glue Workflow sends a Succeeded event. Here’s a CDK/TypeScript snippet on creating the event rule to watch for Glue Crawler state change events and to start the Step Function execution:

 // Event rule to trigger the Step Function
 new events.CfnRule(this, “CrawlerSucceededRule“, {
 description: “Glue crawler succeeded“,
 roleArn: smTriggerRole.roleArn,
 name: `statemachine-trigger-rule`,
 eventPattern: {
 source: [“aws.glue“],
 “detail-type“: [“Glue Crawler State Change“],
 detail: {
 state: [“Succeeded“],
 crawlerName: [{ “equals-ignore-case“: `hsl-data-glue-crawler` }],
 },
 },
 targets: [
 {
 arn: cfnStateMachine.attrArn,
 id: cfnStateMachine.attrName,
 roleArn: smTriggerRole.roleArn,
 },
 ],
 });

Refreshing QuickSight DataSets with AWS Step Functions State Machine

The AWS Step Functions have inbuilt integrations with loads of different AWS services, including AWS QuickSight. Therefore, it is straightforward to build a State Machine that starts refreshing the QuickSight DataSet SPICE – the process is called DataSet Ingestion. The following image shows an AWS Step Functions State Machine that processes QuickSight DataSet Ingestions in two phases – in the first phase, it parallelly ingests data on five QuickSight DataSets: network, stops, passengers, shifts and stoptypes. When all those ingestions have been successfully finished, the State Machine continues to ingest the second set of QuickSight DataSets, which in this example contains only one DataSet: passengers-and-stops.

For each QuickSight DataSet, the State Machine

Starts the Ingestion process with CreateIngestion call and saves the IngestionId value of the started Ingestion process
Checks the Ingestion status with DescribeIngestion call
If IngestionStatus is COMPLETED, CANCELLED or FAILED, it will pass the phase
Otherwise, it will wait for 20 seconds and check the Ingestion status again

Summing it up

As an end result, we now have a data pipeline that is triggered automatically by a predefined schedule, or with EventBridge event, and that starts ingesting QuickSight DataSets in correct order and as quickly as underlying data is updated. And now we can enjoy the actionable, up-to-date insights:

In this article, we reviewed the components of AWS’s serverless data lake solution and explored ways to optimize its performance and user experience. Lastly, we learned how to automate the whole process from data ingestion to data insights with AWS Step Functions and AWS Glue Crawler Event Triggering.

We hope you enjoyed the journey. If you would like to set up a Serverless Data Pipeline and Data Lake on AWS, we are here to help. Just contact NordHero or book a meeting with me!

The examples in this article were build using data adapted from Helsinki Region Transport’s (HSL) public data on transport stations, shifts and passengers. The original data is available on Helsinki Region Infoshare site.

Please follow and like us:

Stiri similare

A Comprehensive Guide to Learning React.js

Claudio Ctin4 weeks ago

React.js, developed and maintained by Facebook, has become one of the most popular JavaScript libraries for building user interfaces, particularly single-page applications (SPAs). Known for its flexibility, efficiency, and ease of use, React has a large community and a wealth of resources for developers at all levels. Whether you’re a beginner or an experienced developer…

🎵 Desplegando infraestructura en AWS desde Backstage 🎵

Claudio Ctin4 weeks ago

👋¡Estamos de vuelta! 👋 Después de algún otro tiempo desaparecidos y alguna que otra certificación en el bolsillo y algún que otro proyecto nuevo en mente. ¡Vamos a darle caña! 🚀 ¿Cuántas veces hemos discutido sobre si la infraestructura debería gestionarse por un lado y la aplicación por otro, si todo en el mismo sitio,…

Validações com FluenValidation

Claudio Ctin4 weeks ago

No desenvolvimento de software, validar os objetos de entrada é essencial para garantir a integridade dos dados e detectar problemas desde as fases iniciais. Nesse cenário, o FluentValidator se destaca como uma biblioteca amplamente utilizada para realizar a validação de modelos de forma eficiente e estruturada. Neste exemplo, construiremos uma API em .NET 8 com a…

14 Must-Know LEETCODE Patterns for Success!

Claudio Ctin4 weeks ago

Mastering coding interviews is crucial for landing a job in tech. One of the best ways to prepare is by identifying and practicing common problem patterns that appear repeatedly on LeetCode. Understanding these patterns not only helps in solving specific problems but also builds a strong foundation for tackling new challenges. Here’s a breakdown of…

hello devs!!

Claudio Ctin4 weeks ago

echo hello world Please follow and like us:

React NATIVE startup issue

Claudio Ctin4 weeks ago

node:events:492 throw er; // Unhandled ‘error’ event ^ Error: EPERM: operation not permitted, lstat ‘D:BackupReactNativeCLIreactnativeguessgameandroidappbuildintermediatesdesugar_graphdebugoutcurrentProjectdirs_bucket_2graph.bin’ Emitted ‘error’ event on NodeWatcher instance at: at D:BackupReactNativeCLIreactnativeguessgamenode_modulesmetronode_modulesjest-haste-mapbuildwatchersNodeWatcher.js:275:14 at callback (D:BackupReactNativeCLIreactnativeguessgamenode_modulesgraceful-fspolyfills.js:306:20) at FSReqCallback.oncomplete (node:fs:199:21) { errno: -4048, code: ‘EPERM’, syscall: ‘lstat’, path: ‘D:BackupReactNativeCLIreactnativeguessgameandroidappbuildintermediatesdesugar_graphdebugoutcurrentProjectdirs_bucket_2graph.bin’ } Node.js v20.10.0 Please follow and like us:

What are the Essential Tools and Software for a Complete Engineering and Design Workflow?

Claudio Ctin4 weeks ago

Choosing the right 3D modeling application is crucial for design and engineering projects. With many options available, it can be tough to find the best fit for specific needs. Here’s a look at some top 3D modeling applications making a big impact in the field: SolidWorks is a popular choice for engineers and designers due…

AI-Driven Test Log Analysis & Reporting for Extracting Test Insights

Claudio Ctin4 weeks ago

As your tech stack expands, the influx of machine data, particularly in the form of distributed log data, becomes overwhelming. Machine data, growing 50 times faster than traditional business data, includes logs documenting events across systems, networks, and applications. The sheer volume and diversity of logs can be daunting. Having said that, log data holds…

Ibuprofeno.py💊| #160: Explica este código Python

Claudio Ctin4 weeks ago

Explica este código Python Dificultad: Intermedio a = [2, 5, 3, 4] a[2:2] = [2] print(a) A. [2, 5, 2, 3, 4] B. [2, 5, 2, 4] C. [2, 5, 3, 2, 4] D. Error Respuesta: 👉 A. [2, 5, 2, 3, 4] Cuando hacemos slicing de listas y el valor de inicio es igual…

Array methods in javascript.

Claudio Ctin4 weeks ago

There are some methods in array 1.push() 2.unshift() 3.pop() 4.shift() 5.splice() 6.slice() 7.indexOf() 8.includes() 9.forEach() 10.map() 11.filter() 12.find() 13.some() 14.every() 15.concat() 16.join() 17.sort() 18.reduce() 1 Push() method *Add new element at last position. syntax array.push(element1, element2, …, elementN) Example let fruits = [‘apple’, ‘banana’]; let newLength = fruits.push(‘orange’, ‘mango’); console.log(fruits); // Output: [‘apple’, ‘banana’, ‘orange’,…

How to Efficiently Process References in MongoDB with Mongoose

Claudio Ctin4 weeks ago

Managing references in a MongoDB database has been challenging, especially when dealing with related documents and ensuring atomic updates. In this post, we’ll explore how I created a robust function that processes references in a document using Mongoose, handling both updates and inserts (upserts) efficiently. We’ll be using Mongoose’s bulkWrite operation to perform multiple updates…

Issue with Vue Router Query Parameters Not Appearing in URL

Claudio Ctin4 weeks ago

Hello all, I’m experiencing an issue with my Vue application where query parameters are not appearing in the URL as expected when navigating to a new route. I’m using Vue 3 with Vue Router, and here are the details of my setup: Issue: When I submit a form to search for jobs, I expect the…

Breaking the Cycle: Understanding Common Productivity Challenges

Claudio Ctin4 weeks ago

During a recent conversation with a friend, she opened up about her daily struggles with productivity. As she described her challenges, I realized with a profound sense of relief that I could relate to every single issue she faced. This shared experience underscored a comforting truth: I’m not alone in my struggles, and neither is…

How to find intersection of two singly linked lists in a simple and optimal way in java

Claudio Ctin4 weeks ago

To find the intersection of two singly linked lists in a simple and optimal way, you can use the following approach. The key idea is to align the ends of both linked lists by adjusting the start point of the longer list, and then traverse both lists simultaneously to find the intersection point. Steps: Calculate…

Hosting a Custom Login and Registration UI with AWS Amplify and AWS Cognito

Claudio Ctin4 weeks ago

We will walk through the steps to adopt a custom login and registration user interface (UI) using AWS Amplify and AWS Cognito. AWS Amplify is a powerful tool that provides simplified framework for developing and running cloud-powered applications, while AWS Cognito provides secure authentication with user management. By the end of this guide, you will…

Secure Your Website with SafeLine: A Free and Easy-to-Use WAF

Claudio Ctin4 weeks ago

If you’ve ever set up your own website, you know how vulnerable it can be to attacks. Today, I’m introducing a simple, free, and effective tool to help protect your site—a Web Application Firewall (WAF) called SafeLine. What is a WAF? WAF stands for Web Application Firewall. Unlike traditional firewalls, WAF operates at the application…

How to Restore SQL Server Database from Backup?

Claudio Ctin4 weeks ago

Database backups in SQL Server are important as they help restore data in case of database corruption, virus attack, server issue, or any disaster. For example, if a virus damages the database, you can use the last created backup to restore the database. In this article, we will see how to restore SQL Server database…

Examples of Implementing Mobile App Push Services in Java, Python, and PHP

Claudio Ctin4 weeks ago

In today’s digital world, mobile app push services have become a crucial bridge connecting users and applications. This article delves into how to integrate these services in Java, Python, and PHP, providing specific code examples to demonstrate their implementation. For more detailed information on mobile app push services, please visit this link. What Are the…

Decoding Apple Maps Guide URLs

Claudio Ctin4 weeks ago

Revisiting the noted release of beta dot maps dot apple dot com just now, I found my way back to back to a “mystery” I deserted some years ago[^1] when “Guides” were first added to Apple Maps. You can see the result inputted into the ChatGPT conversation pasted below of using the share sheet on…

Typora Marked 2 “Support”

Claudio Ctin4 weeks ago

Despite the beautiful myriad of text editors available for macOS, I’ve still found myself using Typora on my old machine. When I recently (read: ridiculously late) discovered that Brett Terpstra’s venerable Marked 2 can be schemed (sortof) with x-marked://, it immediately occurred to me that I could use a custom Typora Export preset to add…

Mastering DevOps Branching: Your Ultimate Guide to Git Flow, Trunk, Tag-Based, and Hybrid Strategies

Claudio Ctin4 weeks ago

This guide explores popular Git branching strategies for DevOps environments, focusing on Git Flow, Trunk-Based Development (TDD), Tag-Based, and Hybrid approaches. We’ll compare their strengths and weaknesses, highlight challenges they address, and discuss their suitability for web applications and ETL processes / applications. Branching Strategies: Git Flow: Workflow: Separate branches for development (develop), feature development…

El Principio Abierto/Cerrado: Flexibilidad sin romper nada

Claudio Ctin4 weeks ago

El otro día estaba en una reunión de “retrospectiva” con mi equipo de desarrollo. Ya sabes, esas reuniones donde analizamos qué ha ido bien, qué ha ido mal y cómo podemos mejorar. En medio de la discusión, María, una desarrolladora, soltó una frase que me dejó pensando: “Chicos, cada vez que añadimos una nueva funcionalidad,…

The Creation of Rooh London: Journey, Challenges, and Future Goals

Claudio Ctin4 weeks ago

Rooh London represents a fusion of elegance and functionality, designed to offer a seamless shopping experience for discerning customers. Our website is more than just an online store; it’s a reflection of our commitment to quality and our passion for craftsmanship. This article delves into the development process of the Rooh London website, the challenges…

Understanding Your Data: The Essentials of Exploratory Data Analysis

Claudio Ctin4 weeks ago

Introduction Although it may sound complicated, exploratory data analysis, EDA, is only a fancy way of saying get to know your data. Imagine it as a first date with your dataset, where you are attempting to learn about it, exploring, and asking questions. Before delving deeper into analysis, this technique helps you uncover patterns, identify…

Introduction to Object-Relational Mappers (ORMs)

Claudio Ctin4 weeks ago

When working with databases in software development, you often need to store, retrieve, update, and delete data. Traditionally, this involves writing SQL queries, which can be time-consuming and error-prone, especially in complex applications. Object-Relational Mappers, or ORMs, provide a solution to this by simplifying how you interact with databases. In this article, we’ll explore what…

Unlocking the Power of Semantic Caching: How This AI Tool Can Boost Your Application’s Performance

Claudio Ctin4 weeks ago

In the competitive landscape of AI-driven applications, speed, efficiency, and accuracy are crucial. As more businesses integrate artificial intelligence into their products, optimizing these systems becomes essential. One of the most effective ways to achieve this is by using semantic caching—a powerful AI tool that can significantly enhance your application’s performance. If you’re looking to…

How MySQL Tuning Can Improve PrestaShop Performance

Claudio Ctin4 weeks ago

PrestaShop is a specialized, free, and open-source e-commerce platform developed in PHP, aimed at providing businesses, from startups to large enterprises, with the infrastructure to launch, manage, and scale their online stores. Offering a rich selection of customizable themes and a comprehensive suite of e-commerce features, PrestaShop supports a wide array of functionalities, such as…

Flax Engine. Exploring game engine & analyzing its source code

Claudio Ctin4 weeks ago

“It’s like Unreal and Unity had a baby,” as the GameDev community has affectionately described the engine. Not only is that a cute way to describe the engine, but it’s also quite spot-on. It’s designed to be a “golden mean” between Unity and Unreal Engine. Intro Hello, dear readers! I’d like to introduce you to…

let, const , var difference in Javascript?

Claudio Ctin4 weeks ago

In JavaScript, let, const, and var are used to declare variables, but they are different in three ways: 1. Scope 2. Reassignment 3. Hoisting 1.Scope: var is a functional scope means we access var variable anywhere within the function if we try access it outside function it will show error undefined Example:- function demo(){ if(true){…

Introduction to AutoMapper in C#

Claudio Ctin4 weeks ago

When building applications, especially those that handle data from different sources, you often find yourself copying values from one object to another. This process can be repetitive and error-prone, but there’s a solution that can save you time and make your code cleaner: AutoMapper. In this article, we’ll dive into what AutoMapper is, when and…

Random Breaking News

Doce firmas tecnológicas andaluzas, en la final del ‘Startup Andalucía Roadshow’

Un total de doce empresas emergentes andaluzas que destacan por la innovación de sus propuestas y soluciones tecnológicas y por su potencial proyección empresarial participarán el próximo jueves en Linares (Jaén) en la final de la tercera edición del programa ‘Startup Andalucía Roadshow’. Esta iniciativa está impulsada por la Consejería de Universidad, Investigación e Innovación...

Bad Monkey Episode 5 Ending Explained & Recap: Where Are Nick & Eve Going?

Photo Credit: Apple TV Plus Bad Monkey Episode 5 is now available on Apple TV Plus, and viewers want to know how it ends and where Nick and Eve are going. The series reveals key components of the central mystery pretty early. But, by focusing a considerable part of the narrative on the supposed antagonist,...

La rebeldía y feminidad de Lafuente seduce en Madrid

La Feria de Libros de la Cuesta de Moyano de Madrid aportó hoy su particular atmósfera a la colección de novia e invitada de la diseñadora leonesa María Lafuente, que lleva por nombre ‘El sentir’. Quince modelos caminaron por este emblemático enclave interactuando con el entorno literario y acompañadas de música en directo para celebrar...

ParalympicsGB talks up LA Games amid US presidential election fears

GB officials welcome ‘inclusive’ LA as host city in 2028Republican nominee Trump has mocked disabled people ParalympicsGB officials say they hope “politics doesn’t get in the way” of a successful Los Angeles Games in four years’ time, amid the prospect of a second Donald Trump presidency. The US presidential election remains on a knife-edge, with...

Andalucía confirma la séptima muerte por virus del Nilo Occidental

El goteo de fallecidos por el Virus del Nilo Occidental no cesa. Andalucía ha confirmado una nueva víctima mortal, la séptima de este segundo brote. Se trata de una persona con patologías previas, vecina de Mairena del Aljarafe (Sevilla). Aunque se han detectado mosquitos portadores de este patógeno en localidades de las provincias de Sevilla,...

Benito Mussolini regresa en una ambiciosa y polémica serie para gritar que nunca se fue

Cuando Luca Marinelli le contó a su abuela que actuaría en una serie sobre Mussolini, la anciana le preguntó en qué papel. El nieto le explicó que encarnaría al protagonista, al Duce. Ella, entonces, solo le contestó dos palabras. Seguir leyendo Please follow and like us:

Kentucky police recover SUV, AR-15 in manhunt for I-75 shooter

Kentucky police have recovered an SUV and an AR-15 rifle in the manhunt for a suspected gunman who, according to authorities, opened fire near Interstate 75, injuring several people and causing a car accident. A small silver colored SUV registered to the suspect – 32-year-old Joseph A. Couch – was recovered off a U.S. Forest...

Billions lost from small retailers in tax evasion, says watchdog

National Audit Office says trail of tax debts left by small UK retailers is widespread and increasing every year The UK is missing out on billions of pounds of revenue each year from small retail businesses that exploit weaknesses in government systems to evade paying tax, the public spending watchdog has warned. The National Audit...

De la Isla Bonita a San Borondón: tierra a la vista

Recorrer los rincones de España desde el punto de vista de la mitología nos otorga una comprensión muy diferente –pero también muy auténtica– de su realidad histórica, geográfica y patrimonial. En las entregas que componen esta columna les propondré un viaje a lo largo de los lugares esenciales de la geografía de leyend que jalonan...

Große Rauchwolke über Hamburg: Carport und Auto brennen ab

Großeinsatz in Stellingen: An der Kieler Straße sind ein Carport und ein Auto abgebrannt. Die Feuerwehr ist mit 40 Kräften vor Ort. Um 17.47 Uhr wurde der Brand gemeldet. Auf einem Hinterhof an der Kieler Straße auf Höhe der Straße Basselweg stand ein Schuppen in Flammen. Das Feuer hatte bereits auf ein Auto übergegriffen. Die...

5 Fall Bags Under $30 from Amazon, Walmart & More

All products and services featured are independently chosen by editors. However, Billboard may receive a commission on orders placed through its retail links, and the retailer may receive certain auditable data for accounting purposes. Fall season is here! If you’re looking to upgrade your bag collection, we’ve put together a list to embrace the season...

Why are Starmer and Reeves so determined to bring Britain down? | William Keegan

Since Labour’s victory the new PM and chancellor have hardly stopped warning us that cuts are on the way. It’s no way to turn a country around The first shop I went into after a brief holiday was our local delicatessen. After the customary exchange of pleasantries the shopkeeper told me his week had started,...

Isco tendrá que operarse de nuevo

Malas noticias en el Betis. Isco, uno de los futbolistas más importantes de la plantilla y de la Liga, será operado el próximo viernes de nuevo de una fractura de peroné al no haber consolidado el callo de su pie izquierdo. La noticia la ha dado a conocer el propio club verdiblanco después de una...

Why This layer 2 Casino Meme Coin has Outperformed Ethereum, Analyst Predict Tier-1 Listing soon

In the rapidly evolving world of cryptocurrency, it’s not uncommon to see newer, more innovative projects quickly overshadow established giants. One such project, a Layer 2 casino meme coin named Mpeppe (MPEPE), has recently made headlines for outperforming Ethereum (ETH), the second-largest cryptocurrency by market capitalization. Analysts are now predicting that Mpeppe (MPEPE) could soon...

Can You Watch Backyard Wilderness Online Free?

Photo Credit: Netflix If you’re wondering where to watch Backyard Wilderness online, you’ve arrived at the right place. Backyard Wilderness is a 2018 documentary movie that showcases the little wonders of nature that can be found right in our backyard. The movie gives a unique spin to the popular genre by helping us seek the...

Drama lui Victor Slav și a Selinei, dezvăluită la Asia Express 2024: “Operat pe cord”

În ediția de aseară a emisiunii Asia Express, Victor Slav și Selina și-au deschis inima în fața fanilor. Cei doi au povestit, printre lacrimi, unele dintre cele mai grele episoade din viața lor. Victor Slav și Selina au povestit în lacrimi cele mai dure momente din viața lor Victor Slav și Selina formează una dintre...

Racism? Poverty, drink and social media? We still don’t know why Britons rioted a month ago – and we need answers | Tim Newburn

After all that violence, we face the choice that confronted Cameron and Thatcher: to seek real explanations or move blindly forward It is just over a month since the first riots broke out in Southport – seemingly provoked by rumours containing false information about the alleged identity of the attacker who killed three children and...

Polițist din Cluj, în arest preventiv după ce ar fi luat mită de la un deținut. Un traficant de droguri i-a promis contravaloarea unui apartament

DIICOT a anunțat duminică punerea sub arest a unui polițist din Cluj, angajat la un centru de detenție. Acesta este acuzat că i-ar fi adus unui traficant de droguri, aflat în arestul poliției, un telefon mobil în schimbul unor sume de bani. Procurorii susțin că traficantul i-ar fi promis contravaloarea unui apartament polițistului dacă-l ajută...

Sommet Chine-Afrique : ce que promet Xi Jinping aux dirigeants africains

“Dans les trois prochaines années, le gouvernement chinois veut fournir un soutien financier à hauteur de 360 milliards de yuans” : à l’occasion du Forum sur la coopération sino-africaine, le président chinois Xi Jinping a promis ce jeudi 5 septembre plus de 50 milliards de dollars de financement sur trois ans aux pays africains, une...

Más de la mitad de las empresas españolas no están preparadas para el relevo generacional

Hasta un 57,65% de las empresas consultadas por el Grupo Adecco e Infoempleo para su último informe sobre la oferta y demanda de empleo en España admiten que tendrán problemas a la hora de sustituir a los empleados que se jubilan próximamente, algo que achacan a la falta de personal en su área de negocio....

Why Does Kamala Harris Laugh so Much? Reasons Behind Laughter

Photo Credit: Win McNamee | Getty Images Many people are discussing Kamala Harris’ frequent laughter, which has become a prominent feature of her public appearances. Her laughter has sparked questions on social media and in political circles with many wondering why Kamala Harris laughs so much and what it reveals about her. Here’s a closer...

Gigi Becali, dezamăgit că Dan Șucu a vrut să i-l fure pe Daniel Bîrligea: „Nu are principii sănătoase! Vede strâmb”

FCSB l-a transferat pe Daniel Bîrligea de la CFR Cluj. Deși Gigi Becali, cât și Neluțu Varga, au anunțat că s-au înțeles în privința jucătorului, Dan Șucu a încercat să „fure” atacantul înțeles cu campioana României. Gigi Becali a fost dezamăgit de omologul său de la Rapid. Gigi Becali, dezamăgit de Dan Șucu: „Nu are...

La democracia del miedo

Hoy se eligen las asambleas de los Länder de Turingia y Sajonia, dos de los Estados de la antigua República Democrática Alemana, y todo hace pensar que el valor simbólico del resultado de estas elecciones es extraordinario. Puede ser la primera vez que la ultraderecha de la AfD sea el partido más votado. El sorpasso...

La financiación singular para Cataluña reaviva las demandas de las comunidades más castigadas por el sistema

El acuerdo para una financiación singular catalana ha reactivado las demandas del resto de territorios, en particular de los que se consideran más castigados por el actual sistema. Andalucía, la Comunidad Valenciana y Castilla-La Mancha han redoblado sus exigencias al Gobierno para que las compense porque reciben menos recursos por habitante que la media. Esa...

Meet the ‘gen Z whisperers’: the young advisers helping companies understand their employees

A new wave of consultants are out to persuade employers their generation doesn’t deserve its bad reputation If Appius Claudius Gaecus had got his way, there would be no gen Z. The Roman censor banned the letter Z from the Latin alphabet some time around 300BC, apparently because the shape of the mouth when making...

Incendiul din Maramureș, stins după 20 de ore: 4 pompieri au fost răniți. Anunțul făcut de președintele Consiliului Județean

“Suprafaţa afectată de incendiu a fost supravegheată pe tot parcursul nopţii în vederea eliminării pericolului de propagare şi extindere, iar vestea bună este ca nu sunt victime şi persoanele evacuate preventiv s-au întors la locuinţele lor”, a precizat Ionel Bogdan. Preşedintele CJ Maramureş a ţinut să remarce implicarea pompierilor militari şi a voluntarilor în acţiunea...