A thread of execution is the smallest sequence of programmed instructions that can be managed independently by the operating system. The implementation of threads and processes differs between operating systems, but in most cases a thread is a component of a process.
- Multiple threads can exist within one process
- They execute concurrently and share resources such as memory
- Different processes do not share these resources
- The threads of a process share its executable code and the values of its variables at any given time
So, to sum up
- Processes are typically independent, while threads exist as subsets of a process
- Processes carry considerably more state information than threads, whereas multiple threads within a process share process state as well as memory and other resources
- Processes have separate address spaces, whereas threads share their address space
- Processes interact only through system-provided inter-process communication mechanisms
- Context switching between threads in the same process is typically faster than context switching between processes.
Single threading means (obviously) executing one single thread at a time. Multithreading allows multiple threads to exist within the context of one process. These threads share the process's resources, but are able to execute independently.
- Responsiveness: multithreading can allow an application to remain responsive to input.
- In a one-thread program, if the main execution thread blocks on a long-running task, the entire application can appear to freeze.
- Moving the task to a worker thread, it is possible for the application to remain responsive to user input while executing tasks in the background
- Note: most cases multithreading is not the only way to keep a program responsive, with non-blocking I/O and/or Unix signals being available for gaining similar results
- Faster execution: multithreaded program allow faster operation on computer systems that have multiple CPUs or one or more multi-core processors, or across a cluster of machines
- works only if the threads are independent, i.e. do not have to wait for one another
- Lower resource consumption: using threads, an application can serve multiple clients concurrently using fewer resources than it would need when using multiple process copies of itself.
- For example, the Apache HTTP server uses thread pools: a pool of listener threads for listening to incoming requests, and a pool of server threads for processing those requests.
- Better system utilization: as an example, a file system using multiple threads can achieve higher throughput and lower latency since data in a faster medium (such as cache memory) can be retrieved by one thread while another thread retrieves data from a slower medium (such as external storage) with neither thread waiting for the other to finish.
- Simplified sharing and communication: unlike processes, which require a message passing or shared memory mechanism to perform inter-process communication (IPC), threads can communicate through data, code and files they already share.
- Parallelization: applications looking to use multicore or multi-CPU systems can use multithreading to split data and tasks into parallel subtasks and let the underlying architecture manage how the threads run, either concurrently on one core or in parallel on multiple cores.
- Synchronization: since threads share the same address space, the programmer must be careful to avoid race conditions and other non-intuitive behaviors.
- In order for data to be correctly manipulated, threads will often need to rendezvous in time in order to process the data in the correct order.
- Threads may also require mutually exclusive operations (often implemented using
mutexes
) in order to prevent common data from being simultaneously modified or read while in the process of being modified
- Thread crashes a process: an illegal operation performed by a thread crashes the entire process; therefore, one misbehaving thread can disrupt the processing of all the other threads in the application.
{% callout "Race conditions and mutexes" %} Threads in the same process share the same address space. This allows concurrently running code to efficiently exchange data without the overhead or complexity of an IPC. When shared between threads, however, even simple data structures become prone to race conditions if they require more than one CPU instruction to update: two threads may end up attempting to update the data structure at the same time and find it unexpectedly changing underfoot. Bugs caused by race conditions can be very difficult to reproduce and isolate.
Let's look at this in practice and assume that two threads want to increment the value of a global integer variable by one. Ideally, the following sequence of operations would take place
However, if the two threads run simultaneously without locking or synchronization, the outcome of the operation could be wrong
To prevent this, threading application programming interfaces (APIs) offer synchronization primitives such as
mutexes
to lock data structures against concurrent access.
Mutex is a portmanteau of 'mutual exclusion': one thread of execution never enter its critical section at the same time that another concurrent thread of execution enters its own critical section.
- On uniprocessor systems, a thread running into a locked mutex must sleep and hence trigger a context switch
- On multi-processor systems, the thread may instead poll the mutex in a
spinlock
which intermittently checks if the mutex has been released
Several mutexes exist, one that you often see
- Locks (usually just called
mutex
), behaving as described above - Semaphores: simple data types. Consider a variable A and a boolean variable S. A is only accessed when S is marked true. Thus, S is a semaphore for A
- and many more (but we are not computer scientists)
Mutexes make your code safe for race conditions, but as a trade-off, they can lead to deadlocks. A deadlock is a state in which each member of a group is waiting for some other member to take action, such as sending a message or more commonly releasing a lock.
Conversely, one can also run into a livelock, where the states of the processes involved in the livelock constantly change with regard to one another, none progressing.
Most OSs cannot prevent deadlocks!
{% endcallout %}
Hopefully, by now, you have a good idea of what multi threading and multi processing entails, what are the benefits, and what are the difficulties. Let's write our own little multi threader program now!