Three ways to implement python multithreading concurrent processing

Tag: python's spoof technique Optimal number of threads Ncpu = number of CPUs Ucpu = target CPU usage W/C = ratio of waiting time to calculation ...
Optimal number of threads
Line city
Method 1: use threadpool module
Method 2: use the concurrent.futures module
Method 3: use the vthread module

Tag: python's spoof technique

Optimal number of threads

  • Ncpu = number of CPUs
  • Ucpu = target CPU usage
  • W/C = ratio of waiting time to calculation time

In order to maintain the expected utilization of the processor, the optimal thread pool size is equal to
$$Nthreads=Ncpu*Ucpu*(1+W/C$$

  • cpu intensive tasks, that is, $W < < C $, then $W/C ≈ 0 $, then $Nthreads=Ncpu*Ucpu$

If you want 100% CPU utilization, $Nthreads=Ncpu$

  • IO intensive tasks, that is, most of the time the system interacts with I/O, and this time thread does not occupy the CPU for processing, that is, in this time range, other threads can use the CPU, so some more threads can be configured.
  • Hybrid tasks, both of which take up a certain amount of time

Line city

For the program with increasing number of tasks, each task will generate a thread, which will eventually cause the number of threads out of control. For programs with increasing tasks, a fixed number of threads pool is necessary.

Method 1: use threadpool module

threadpool is an older module, supporting py2 and py3.

import threadpool import time def sayhello (a): print("hello: "+a) time.sleep(2) def main(): global result seed=["a","b","c"] start=time.time() task_pool=threadpool.ThreadPool(5) requests=threadpool.makeRequests(sayhello,seed) for req in requests: task_pool.putRequest(req) task_pool.wait() end=time.time() time_m = end-start print("time: "+str(time_m)) start1=time.time() for each in seed: sayhello(each) end1=time.time() print("time1: "+str(end1-start1)) if __name__ == '__main__': main(

Method 2: use the concurrent.futures module

from concurrent.futures import ThreadPoolExecutor import time def sayhello(a): print("hello: "+a) time.sleep(2) def main(): seed=["a","b","c"] start1=time.time() for each in seed: sayhello(each) end1=time.time() print("time1: "+str(end1-start1)) start2=time.time() with ThreadPoolExecutor(3) as executor: for each in seed: executor.submit(sayhello,each) end2=time.time() print("time2: "+str(end2-start2)) start3=time.time() with ThreadPoolExecutor(3) as executor1: executor1.map(sayhello,seed) end3=time.time() print("time3: "+str(end3-start3)) if __name__ == '__main__': main()

Method 3: use the vthread module

Reference resources: https://pypi.org/project/vthr...

demo1

import vthread @vthread.pool(6) def some(a,b,c): import time;time.sleep(1) print(a+b+c) for i in range(10): some(i,i,i)

demo2: grouping thread pool

import vthread pool_1 = vthread.pool(5,gqueue=1) # open a threadpool with 5 threads named 1 pool_2 = vthread.pool(2,gqueue=2) # open a threadpool with 2 threads named 2 @pool_1 def foolfunc1(num): time.sleep(1) print(f"foolstring1, test3 foolnumb1:") @pool_2 def foolfunc2(num): time.sleep(1) print(f"foolstring2, test3 foolnumb2:") @pool_2 def foolfunc3(num): time.sleep(1) print(f"foolstring3, test3 foolnumb3:") for i in range(10): foolfunc1(i) for i in range(4): foolfunc2(i) for i in range(2): foolfunc3(i)

3 December 2019, 05:53 | Views: 5092

Add new comment

For adding a comment, please log in
or create account

0 comments