编程问答

前端开发问题 Java开发问题 C/C++开发问题 Python开发问题 C#/.NET开发问题 php开发问题 移动开发问题 数据库问题

python multiprocessing vs threading for cpu bound work on wi

2023-03-14Python开发问题

本文介绍了python multiprocessing vs threading for cpu bound work on windows and linux的处理方法，对大家解决问题具有一定的参考价值，需要的朋友们下面随着跟版网的小编来一起学习吧！

问题描述

所以我敲了一些测试代码，看看多处理模块在 cpu 绑定工作上与线程相比如何扩展.在 linux 上，我得到了预期的性能提升:

So I knocked up some test code to see how the multiprocessing module would scale on cpu bound work compared to threading. On linux I get the performance increase that I'd expect:

linux (dual quad core xeon):
serialrun took 1192.319 ms
parallelrun took 346.727 ms
threadedrun took 2108.172 ms

我的双核 macbook pro 显示相同的行为:

My dual core macbook pro shows the same behavior:

osx (dual core macbook pro)
serialrun took 2026.995 ms
parallelrun took 1288.723 ms
threadedrun took 5314.822 ms

然后我在一台windows机器上试了一下，得到了一些非常不同的结果.

I then went and tried it on a windows machine and got some very different results.

windows (i7 920):
serialrun took 1043.000 ms
parallelrun took 3237.000 ms
threadedrun took 2343.000 ms

为什么，为什么，Windows 上的多处理方法这么慢?

Why oh why, is the multiprocessing approach so much slower on windows?

这是测试代码:

#!/usr/bin/env python

import multiprocessing
import threading
import time

def print_timing(func):
    def wrapper(*arg):
        t1 = time.time()
        res = func(*arg)
        t2 = time.time()
        print '%s took %0.3f ms' % (func.func_name, (t2-t1)*1000.0)
        return res
    return wrapper


def counter():
    for i in xrange(1000000):
        pass

@print_timing
def serialrun(x):
    for i in xrange(x):
        counter()

@print_timing
def parallelrun(x):
    proclist = []
    for i in xrange(x):
        p = multiprocessing.Process(target=counter)
        proclist.append(p)
        p.start()

    for i in proclist:
        i.join()

@print_timing
def threadedrun(x):
    threadlist = []
    for i in xrange(x):
        t = threading.Thread(target=counter)
        threadlist.append(t)
        t.start()

    for i in threadlist:
        i.join()

def main():
    serialrun(50)
    parallelrun(50)
    threadedrun(50)

if __name__ == '__main__':
    main()

2024-08-22 Python开发问题

Pandas中的GROUP BY AND SUM不丢失列

Group by and Sum in Pandas without losing columns(Pandas中的GROUP BY AND SUM不丢失列)...

2024-08-22 Python开发问题

pandas 有从特定日期开始的按月分组的方式吗？

Is there a way of group by month in Pandas starting at specific day number?( pandas 有从特定日期开始的按月分组的方式吗？)...

2024-08-22 Python开发问题

GROUP BY+新列+基于条件的前一行抓取值

Group by + New Column + Grab value former row based on conditionals(GROUP BY+新列+基于条件的前一行抓取值)...

2024-08-22 Python开发问题

PANDA中的Groupby算法和插值算法

Groupby and interpolate in Pandas(PANDA中的Groupby算法和插值算法)...

2024-08-22 Python开发问题

PANAS-基于列对行进行分组，并将NaN替换为非空值

Pandas - Group Rows based on a column and replace NaN with non-null values(PANAS-基于列对行进行分组，并将NaN替换为非空值)...

2024-08-22 Python开发问题

相关推荐

在xarray中按单个维度的多个坐标分组

Pandas中的GROUP BY AND SUM不丢失列

pandas 有从特定日期开始的按月分组的方式吗？

GROUP BY+新列+基于条件的前一行抓取值

PANDA中的Groupby算法和插值算法

PANAS-基于列对行进行分组，并将NaN替换为非空值

热门文章

热门精品源码

最新VIP资源

python multiprocessing vs threading for cpu bound work on wi

问题描述

推荐答案

相关推荐

在xarray中按单个维度的多个坐标分组

Pandas中的GROUP BY AND SUM不丢失列

pandas 有从特定日期开始的按月分组的方式吗？

GROUP BY+新列+基于条件的前一行抓取值

PANDA中的Groupby算法和插值算法

PANAS-基于列对行进行分组，并将NaN替换为非空值

热门文章

热门精品源码

最新VIP资源