The Diligent Engineer

Introduction to the new thread-safe event package for Golang

Events are a common pattern in many applications, especially those that involve user interactions, asynchronous operations, or communication between different components. Events allow you to decouple the logic of the event source from the logic of the event listeners, making your code more modular, reusable, and testable.

However, managing events and listeners in Go can be tricky, especially when concurrency is involved. You need to ensure that your event channels are properly created, closed, and buffered, and that your listeners are registered and unregistered correctly. You also need to handle any errors or panics that may occur during the event processing.

That's why we created the new event package, a simple and thread-safe mechanism for managing events and listeners in Go applications. The event package provides a high-level API that abstracts away the low-level details of creating and managing event channels and listeners. It also handles any errors or panics gracefully, ensuring that your application does not crash or leak resources.

Installation

To use the event package in your Go project, you can use the following go get command:

go get -u github.com/lltpkg/event

Quick start

To get started with the event package, you need to import it in your Go file:

import "github.com/lltpkg/event"

Then, you can create an event channel and a listener for any event name you want. For example, let's create an event channel and a listener for the "Greeting" event:

package main

import (
"fmt"

"github.com/lltpkg/event"
)

func main() {
// Create an event channel and a cleanup function for the "Greeting" event
evChan, cleanup := event.EventChannel("Greeting")
// Make sure to call the cleanup function when done
defer cleanup()

// Listen for the "Greeting" event in a goroutine
go func() {
// Receive the event data from the channel
receivedData := <-evChan
// Print the greeting message
fmt.Println("Hello,", receivedData)
}()

// Trigger the "Greeting" event with some data
event.FireEvent("Greeting", "World")
}

If you run this code, you should see the following output:

Hello, World

As you can see, the event package makes it easy to create and use event channels and listeners in Go. You don't need to worry about creating, closing, or buffering the channels, or registering or unregistering the listeners. The event package takes care of all that for you.

Usage

Creating Events and Listeners The event package allows you to create named events and associate listeners with them. You can use the EventChannel function to create an event channel and a cleanup function for any event name:

// Create an event channel and a cleanup function for "exampleEvent"
eventChan, cleanup := event.EventChannel("exampleEvent")
// Make sure to call the cleanup function when done
defer cleanup()

The event channel is a chan interface{} that receives the event data whenever the event is triggered. The cleanup function is a func() that closes the event channel and unregisters the listener from the event. You should always call the cleanup function when you are done with the event channel, otherwise you may leak resources or cause deadlocks.

You can create as many event channels and listeners as you want for the same or different event names. The event package will ensure that each listener receives the event data in a thread-safe manner.

Triggering Events

You can trigger events using the FireEvent function. This function allows you to send data to all registered listeners for a specific event name:

// Trigger the "exampleEvent" with some data
event.FireEvent("exampleEvent", "event data")

The data can be any value that implements the interface{} type. The FireEvent function will send the data to all the event channels that are listening for the event name. The function will also handle any errors or panics that may occur during the event processing, and log them using the standard log package.

You can trigger events from anywhere in your code, as long as you import the event package. The FireEvent function is thread-safe and non-blocking, so you can use it in concurrent or asynchronous contexts.

Conclusion

We hope that you find the event package useful and easy to use. If you have any feedback, suggestions, or issues, please feel free to open an issue or a pull request on the GitHub repository. We appreciate any contributions that make the event package more robust and versatile.

A note on TypeScript discrimination unions

The Diligent Engineer

Recently I started using discrimination unions a lot in typescript. This technique allows us to write more type-safe codes, for instance:

interface Bird {
  name: "bird",
  fly(): void;
  layEggs(): void;
}

interface Fish {
  name: "fish",
  swim(): void;
  layEggs(): void;
}

declare function getSmallPet(): Fish | Bird;

let pet = getSmallPet();

This works:

pet.layEggs();

But not this:

pet.swim(); 
// Errors in code:
// Property 'swim' does not exist on type 'Bird | Fish'.
//   Property 'swim' does not exist on type 'Bird'.

This forces us to add additional checks to pass the compiler error:

if(pet.name === 'fish'){
  pet.swim(); // this will work
}

See more on typescriptlang.org/docs

Write an O(m*n) LeetCode solution of find-and-replace-pattern that beats 100.00% of users with TypeScript [54ms]

The Diligent Engineer

Try to solve the solution before read this article at: leetcode.com/problems/find-and-replace-pattern

Intuition

Thoughts:

Focus on matching patterns, not exact characters. The problem doesn't require matching exact characters, but rather identifying words that follow the same pattern as the given pattern string.
Character consistency is crucial. A word matches the pattern if we can consistently replace each character in the pattern with a unique character in the word, and vice versa.

Approach

Transform Words and Pattern:
- Create a function wordToId that transforms a word into a pattern-like string:
  - It assigns a unique integer ID to each distinct character in the word.
  - It joins these IDs using hyphens to create a pattern-like representation.
- Apply wordToId to both the pattern and each word in the words array.
Identify Matching Words:
- Iterate through each word in the words array.
- If the transformed pattern-like string of the word matches the transformed pattern, add the original word to the result array.
Return Matching Words:
- Return the result array containing the words that match the pattern.

Complexity

Time complexity: O(n * m), where n is the number of words in the words array and m is the length of each word (and the pattern). This is due to iterating through each word and transforming it character by character.
Space complexity: O(m), where m is the length of the words and pattern. The space used is primarily for the hash map in wordToId and the transformed strings.

Flow

graph TD
subgraph "findAndReplacePattern function"
    Start("Start") --> Transform["Transform pattern using wordToId"]
        Transform --> Iterate["Iterate through words array"]
        Iterate --Check if word length matches pattern length--> CheckLength{"Check length"}
        CheckLength -- No --> End("End")
        CheckLength -- Yes --> Transform2["Transform word using wordToId"]
        Transform2 --"Compare transformed word with pattern"--> Compare{"Compare pattern"}
        Compare -- No --> Iterate
        Compare -- Yes --> AddToResult["Add word to result array"]
        AddToResult --> Iterate
    End(End)
end

Code

TypeScript

function findAndReplacePattern(words: string[], pattern: string): string[] {
  const result: string[] = [];
  const wordToId = ((w: string) => {
    const idGetter = {
      hMap: {} as Record<string, number>,
      id: 0,
      getId() {
        return ++this.id
      },
      reset() {
        this.id = 0
        this.hMap = {}
      }
    }
    return [...w].map(c => {
      if (idGetter.hMap[c]) {
        return idGetter.hMap[c]
      } else {
        idGetter.hMap[c] = idGetter.getId()
        return idGetter.hMap[c]
      }
    }).join('-')
  })
  const p = wordToId(pattern)
  words.forEach(w => {
    if (w.length !== pattern.length) return
    const id = wordToId(w) 
    if (p === id) {
      result.push(w)
    }
  })

  return result
};

Rust

use std::collections::HashMap;

impl Solution {
    pub fn find_and_replace_pattern(words: Vec<String>, pattern: String) -> Vec<String> {
        let mut result = Vec::new();

        fn hash_word(w: &str) -> String {
            let mut char_to_id = HashMap::new();
            let mut id = 0;
            w.chars().map(|c| {
                *char_to_id.entry(c).or_insert_with(|| { id += 1; id })
            }).map(|id| id.to_string()).collect::<Vec<_>>().join("-")
        }

        let p = hash_word(&pattern);

        for w in words.iter() {
            if w.len() != pattern.len() {
                continue;
            }
            if p == hash_word(w) {
                result.push(w.clone());
            }
        }

        result

    }
}

Golang

func findAndReplacePattern(words []string, pattern string) []string {
    result := make([]string, 0)
    hashWord := func(w string) string {
        hMap := map[rune]int{}
        id := 0
        hashed := make([]string, 0, len(w))
        for _, c := range w {
            if _, ok := hMap[c]; !ok {
                hMap[c] = id
                id++
            }
            hashed = append(hashed, strconv.Itoa(hMap[c]))
        }
        return strings.Join(hashed, "-")
    }
    p := hashWord(pattern)
    for _, w := range words {
        if len(w) != len(pattern) {
            continue
        }
        if p == hashWord(w) {
            result = append(result, w)
        }
    }
    return result
}

Reference

LeetCode submissions: TypeScript, Rust, Golang.

Inlining in Go

The Diligent Engineer

The process of grouping smaller functions into their respective callers in Go Compiler is known as inlining. This optimization was often done by hand in the early days of computing. Inlining is now one of the essential optimizations that are executed automatically throughout the compilation process.

The Mechanism

Function Identification: The compiler scans your code, identifying functions suitable for inlining. It prioritizes smaller, frequently called functions with simple bodies.

Eliminating Overhead: Instead of generating a separate function call, the compiler replaces the call with the function's code, removing the overhead of:

Jumping to a different memory location
Passing arguments
Returning results

Code Integration: The inlined function's code is seamlessly integrated into the caller's code, creating a single, streamlined block.

Advantages of Inlining

Performance Boost:
- Reduced function call overhead
- Enhanced optimization opportunities
Code Size Reduction: Elimination of redundant code in some cases
Improved Cache Utilization: Better spatial locality of code and data

flowchart TB

    X[1. Start compile]
    X-->SC
    F[Executable bin] 

    subgraph SC[Source Code main.go]

        subgraph A[Function main]
            D[function main Body]
        end  

        subgraph B["function a()"]
            C["function a() body"]
        end 
        A--2. Call(1) -->B
        C--3. Append (inlining)-->D

    end
     SC--4. build --> F

Example

func add(x, y int) int {
    return x + y
}

func main() {
    result := add(5, 3) // Function call
    // ...
}

After inlining:

func main() {
    result := 5 + 3 // Inlined code
    // ...
}

The Go Compiler's Inlining Decisions

Function Size: Small, focused functions are more likely to be inlined.
Call Frequency: Frequently called functions are prime candidates.
Loop Presence: Functions containing loops are less likely to be inlined.
Closures and Recursion: These complexities can hinder inlining.

Key Considerations

Over-inlining: This can lead to larger code size and potential cache misses.
Readability: Excessive inlining can impact code clarity.
Benchmarking: Measure performance gains to ensure effectiveness.

Conclusion

Function inlining is a potent optimization tool in Go, but its judicious application is crucial. Understanding its mechanics and trade-offs empowers you to write efficient and maintainable Go code. By carefully considering function design, profiling performance, and leveraging compiler hints when necessary, you can effectively harness the power of inlining for optimal code execution.

PostgreSQL-to-Elasticsearch synchronization: Logstash with JDBC input plugin Vs. PGSync

The Diligent Engineer

After hours of implementing the PostgreSQL-to-Elasticsearch synchronization service in many different ways, my team and I at GT finally decided to use PGSync. Here's a comparison of Logstash with the JDBC input plugin and PGSync for PostgreSQL-to-Elasticsearch synchronization:

Factor	Logstash with JDBC Input Plugin	PGSync
Overview	General-purpose data pipeline tool with a JDBC plugin for database integration	Specialized tool for PostgreSQL replication to Elasticsearch
Integration	Connects to various databases, not PostgreSQL-specific	Optimized for PostgreSQL, leveraging its logical decoding feature
Configuration	Requires defining pipelines and filters for data processing	Configuration is simpler, focused on replication settings
Data Transformation	Offers extensive filtering and transformation capabilities within pipelines	Limited to basic filtering and mapping
Performance	Can be slower for large-scale replication due to overhead of pipeline processing	Generally faster due to direct replication without intermediate processing
Scalability	Can be horizontally scaled by adding more Logstash nodes	Limited to vertical scaling of a single PGSync instance
Error Handling	Provides mechanisms for retrying failed events and handling errors	Less robust error handling mechanisms
Monitoring	Integrates with monitoring tools for pipeline visibility	Limited monitoring capabilities
Maintenance	Requires managing Logstash and its dependencies	Simpler setup and maintenance

Choosing the right tool depends on your specific needs:

If you require extensive data transformation or integration with other data sources, Logstash is a good choice.
If you need high-performance replication with minimal overhead and a PostgreSQL-specific focus, PGSync is a better option.
Consider factors like scalability, error handling, monitoring, and maintenance requirements when deciding.

Additional considerations:

Logstash offers more flexibility for custom data processing and integration.
PGSync is typically more efficient for large-scale replication.
Both tools can be used in conjunction for more complex scenarios.

Recommendations:

For basic, high-performance replication, PGSync is often the preferred choice.
For more complex data processing and integration needs, Logstash provides greater capabilities.
Evaluate your specific use case and requirements to determine the best tool.

Streaming Large Files over TCP

The Diligent Engineer

When dealing with large files in development, the traditional approach of reading the entire file into memory can lead to memory-related issues. To overcome this challenge, streaming becomes a crucial concept. Streaming involves sending files in smaller, manageable chunks rather than all at once. In this article, we will explore how to implement a file server in Golang that can efficiently stream large files over TCP connections.

Setting the Stage

Let's begin by understanding the importance of streaming and the pitfalls of loading large files into memory. Loading large files at once can lead to memory exhaustion, especially in scenarios where the file size exceeds available system resources.

Building a Simple File Server

To implement a file server in Golang, we'll leverage the net package for handling network connections and the io package for reading and writing data. Additionally, we'll use a buffer to efficiently store and manage the streaming data.

// File Server

// Listen for incoming connections
listener, err := net.Listen("tcp", ":8080")
if err != nil {
    log.Fatal(err)
}

for {
    // Accept connection
    conn, err := listener.Accept()
    if err != nil {
        log.Fatal(err)
    }

    // Handle connection concurrently
    go handleConnection(conn)
}

// Handle Connection
func handleConnection(conn net.Conn) {
    defer conn.Close()

    // Open the file
    file, err := os.Open("largefile.dat")
    if err != nil {
        log.Fatal(err)
    }
    defer file.Close()

    // Create a buffer for streaming
    buffer := make([]byte, 1024)

    // Read and stream file data
    for {
        bytesRead, err := file.Read(buffer)
        if err == io.EOF {
            break
        }
        conn.Write(buffer[:bytesRead])
    }
}

Implementing the Client

Now that we have our file server, let's create a client program that sends a file to the server. We'll use the net and io packages, along with a buffer for efficient data transfer.

// File Client

// Dial the server
conn, err := net.Dial("tcp", "localhost:8080")
if err != nil {
    log.Fatal(err)
}
defer conn.Close()

// Open the file to be sent
file, err := os.Open("largefile.dat")
if err != nil {
    log.Fatal(err)
}
defer file.Close()

// Create a buffer for streaming
buffer := make([]byte, 1024)

// Read and send file data
for {
    bytesRead, err := file.Read(buffer)
    if err == io.EOF {
        break
    }
    conn.Write(buffer[:bytesRead])
}

Handling Unknown File Sizes

A common challenge in file streaming is not knowing the size of the file in advance. To address this, we can send the file size as part of the data, allowing the server to anticipate the incoming data and allocate resources accordingly.

// Sending File Size

// Get file size
fileInfo, err := file.Stat()
if err != nil {
    log.Fatal(err)
}

// Convert file size to bytes
fileSize := make([]byte, 8)
binary.BigEndian.PutUint64(fileSize, uint64(fileInfo.Size()))

// Send file size to the server
conn.Write(fileSize)

Reading File Size on the Server Side

On the server side, we need to read the file size from the incoming data to allocate the necessary resources.

// Reading File Size on Server Side

// Create a buffer for receiving file size
fileSizeBuffer := make([]byte, 8)

// Read file size from the client
_, err := conn.Read(fileSizeBuffer)
if err != nil {
    log.Fatal(err)
}

// Convert bytes to uint64
fileSize := binary.BigEndian.Uint64(fileSizeBuffer)

Bringing It All Together

To complete the process, we can now integrate the techniques discussed to stream a large file. This involves sending the file size and subsequently streaming the data in manageable chunks.

// Streaming a Large File

// ... (Previous code for setting up connection and handling file)

// Send file size to the server
conn.Write(fileSize)

// Read and stream file data
for {
    bytesRead, err := file.Read(buffer)
    if err == io.EOF {
        break
    }
    conn.Write(buffer[:bytesRead])
}

Sequence diagram

sequenceDiagram
    participant Client
    participant Server
    Client->>Server: Initiate TCP connection
    Server->>Client: Accept TCP connection
    loop Until file is fully transferred
        Client->>Client: Read file chunk
        Client->>Server: Send chunk header (size, sequence number, etc.)
        Server->>Client: Acknowledge chunk header
        Client->>Server: Send chunk data
        Server->>Client: Acknowledge chunk data
        Server->>Server: Write chunk to file
    end
    Client->>Server: Send file transfer completion signal
    Server->>Client: Acknowledge completion
    Server->>Client: Close TCP connection
    Client->>Server: Close TCP connection

Initiate TCP connection: The client starts the process by establishing a TCP connection with the server.
Accept TCP connection: The server accepts the incoming connection request from the client.
Read file chunk: The client reads a portion of the file into a buffer (chunk).
Send chunk header: The client sends information about the chunk (size, sequence number, etc.) to the server.
Acknowledge chunk header: The server confirms receipt of the chunk header.
Send chunk data: The client sends the actual chunk data to the server.
Acknowledge chunk data: The server confirms receipt of the chunk data.
Write chunk to file: The server writes the received chunk to the destination file.
Loop until completion: Steps 3-8 repeat until the entire file has been transferred.
Send completion signal: The client sends a signal to the server indicating that the file transfer is complete.
Acknowledge completion: The server acknowledges the completion signal.
Close TCP connection: Both the client and server close the TCP connection.

Key Points:

Chunking: The file is divided into smaller chunks for efficient transmission and potential retransmission if errors occur.
Headers: Chunk headers contain metadata about the chunk, such as its size and sequence number, for proper reassembly at the server.
Acknowledgments: Server acknowledgments ensure reliable transfer and allow for retransmission if necessary.
Completion signal: The completion signal marks the end of the file transfer process.
TCP connection management: The TCP connection is established, maintained, and closed to ensure reliable data transfer.

Conclusion

In this article, we explored the challenges associated with handling large files in Golang and demonstrated how to implement a file server capable of streaming files over TCP connections. Leveraging the net and io packages, along with effective buffer usage, ensures efficient and reliable large file transfers. By including the file size as part of the data, we address the issue of not knowing the file size in advance, enabling seamless and secure streaming.

Accelerate Development with NX Build System [slideshare]

The Diligent Engineer

On March 29, 2023, I had the pleasure of hosting a dynamic presentation at Groove Technology, unveiling insights on how to expedite development processes using the NX Build System.

The session delved into strategies and practices aimed at accelerating development within our team. By harnessing the capabilities of the NX Build System, we explored avenues to enhance efficiency and streamline workflows for optimal outcomes.

For those eager to revisit the key highlights or for those who couldn't attend, the presentation slides are accessible here.

Slideshare.net: Accelerate Development with NX Build System from Thien Ly

I look forward to the positive impact this knowledge can have on our development endeavors at GT and encourage everyone to explore the materials shared for further enrichment.

Garbage Collectors and Memory Leaks in Nodejs - V8 [slideshare]

The Diligent Engineer

On November 10, 2022, I had the privilege of conducting an insightful workshop on "Garbage Collection and Memory Leaks in Node.js V8" at TIKI. The session aimed to deepen our understanding of these critical aspects, shedding light on best practices and strategies to optimize performance.

Slideshare.net: Garbage collectors and Memory Leaks in Nodejs - V8 from Thien Ly

Additionally, the workshop was captured in a Vietnamese-language video hosted on Tiki Developer's YouTube channel. You can view the recording at youtu.be/u3Dvh-ofzNg.

To further engage in discussions and access additional resources related to the workshop, you can visit the TIKI Community forum post Building a Super App Workshop: Sharing Garbage Collectors and Memory Leaks in Node.js V8

Thank you to everyone who participated, and I hope the knowledge shared contributes positively to our endeavors to enhance the efficiency of Node.js and V8.

A glance at the Rust SWC [slideshare]

The Diligent Engineer

On September 8, 2022, I had the privilege of presenting a cutting-edge Javascript and Typescript compiler to my team at TIKI. Allow me to introduce SWC, a revolutionary high-performance compiler crafted in Rust, boasting a customizable plugin API. This innovation has the potential to bring about significant improvements in the development experience (DX).

Slideshare.net: A glance at the Rust SWC from Thien Ly

The SWC compiler, being written in Rust, ensures not only robustness but also efficiency, paving the way for a smoother and more productive development process. During the presentation at Tiki Co., Ltd. (tiki.vn), I showcased the compiler's capabilities and demonstrated how it could elevate our development workflows.

For those eager to delve deeper into SWC, I encourage you to explore the comprehensive documentation available at swc.rs. It provides an in-depth understanding of the compiler's features and functionalities, offering a valuable resource for leveraging its full potential.

To revisit the highlights of the presentation or share insights with colleagues, you can access the slides through the following link: A Glance at the Rust SWC

Thank you for your attention, and I look forward to the positive impact that SWC can bring to our development endeavors.

The Diligent Engineer

Introduction to the new thread-safe event package for Golang

Installation

Quick start

Usage

Triggering Events

Conclusion

A note on TypeScript discrimination unions

Write an O(m*n) LeetCode solution of find-and-replace-pattern that beats 100.00% of users with TypeScript [54ms]

Intuition

Approach

Complexity

Flow

Code

TypeScript

Rust

Golang

Reference

Inlining in Go

The Mechanism

Advantages of Inlining

Example

The Go Compiler's Inlining Decisions

Key Considerations

Conclusion

PostgreSQL-to-Elasticsearch synchronization: Logstash with JDBC input plugin Vs. PGSync

Streaming Large Files over TCP

Setting the Stage

Building a Simple File Server

Implementing the Client

Handling Unknown File Sizes

Reading File Size on the Server Side

Bringing It All Together

Key Points:

Conclusion

Accelerate Development with NX Build System [slideshare]

Garbage Collectors and Memory Leaks in Nodejs - V8 [slideshare]

A glance at the Rust SWC [slideshare]

a reminder

Just a reminder for myself

for the same wish:

"I wish to be stronger,

so that I could be kinder."

Resume