鱼C论坛

 找回密码
 立即注册
查看: 2886|回复: 5

[技术交流] <标准库> itertools模块

[复制链接]
发表于 2015-3-21 09:35:30 | 显示全部楼层 |阅读模式

马上注册,结交更多好友,享用更多功能^_^

您需要 登录 才可以下载或查看,没有账号?立即注册

x
Python:itertools模块

itertools模块包含创建有效迭代器的函数,可以用各种方式对数据进行循环操作,此模块中的所有函数返回的迭代器都可以与for循环语句以及其他包含迭代器(如生成器和生成器表达式)的函数联合使用。

#########################

chain(iter1, iter2, ..., iterN)
给出一组迭代器(iter1, iter2, ..., iterN),此函数创建一个新迭代器来将所有的迭代器链接起来,返回的迭代器从iter1开始生成项,知道iter1被用完,然后从iter2生成项,这一过程会持续到iterN中所有的项都被用完。
  1. &#160;from itertools import chain
  2. &#160;test = chain('AB', 'CDE', 'F')
  3. &#160;for el in test:
  4. &#160;&#160; &#160; print el
复制代码
  1. A
  2. B
  3. C
  4. D
  5. E
  6. F
复制代码


chain.from_iterable(iterables)
一个备用链构造函数,其中的iterables是一个迭代变量,生成迭代序列,此操作的结果与以下生成器代码片段生成的结果相同:
  1. >>> def f(iterables):
  2. &#160;&#160; &#160;for x in iterables:
  3. &#160;&#160; &#160; &#160; &#160;for y in x:
  4. &#160;&#160; &#160; &#160; &#160; &#160; &#160;yield y

  5. >>> test = f('ABCDEF')
  6. >>> test.next()
  7. 'A'
  8. >>> from itertools import chain
  9. >>> test = chain.from_iterable('ABCDEF')
  10. >>> test.next()
  11. 'A'
复制代码


combinations(iterable, r)
创建一个迭代器,返回iterable中所有长度为r的子序列,返回的子序列中的项按输入iterable中的顺序排序:
  1. >>> from itertools import combinations
  2. >>> test = combinations([1,2,3,4], 2)
  3. >>> for el in test:
  4. &#160;&#160; &#160;print el
  5. &#160;&#160; &#160;&#160; &#160; &#160;
  6. &#160;7 (1, 2)
  7. &#160;8 (1, 3)
  8. &#160;9 (1, 4)
  9. 10 (2, 3)
  10. 11 (2, 4)
  11. 12 (3, 4)
复制代码


count([n])
创建一个迭代器,生成从n开始的连续整数,如果忽略n,则从0开始计算(注意:此迭代器不支持长整数),如果超出了sys.maxint,计数器将溢出并继续从-sys.maxint-1开始计算。


cycle(iterable)
创建一个迭代器,对iterable中的元素反复执行循环操作,内部会生成iterable中的元素的一个副本,此副本用于返回循环中的重复项。


dropwhile(predicate, iterable)
创建一个迭代器,只要函数predicate(item)为True,就丢弃iterable中的项,如果predicate返回False,就会生成iterable中的项和所有后续项。
  1. def dropwhile(predicate, iterable):
  2. &#160;&#160; &#160;# dropwhile(lambda x: x<5, [1,4,6,4,1]) --> 6 4 1
  3. &#160;&#160; &#160;iterable = iter(iterable)
  4. &#160;&#160; &#160;for x in iterable:
  5. &#160;&#160; &#160; &#160; &#160;if not predicate(x):
  6. &#160;&#160; &#160; &#160; &#160; &#160; &#160;yield x
  7. &#160;&#160; &#160; &#160; &#160; &#160; &#160;break
  8. &#160;&#160; &#160;for x in iterable:
  9. &#160;&#160; &#160; &#160; &#160;yield x
复制代码


groupby(iterable[, key])
创建一个迭代器,对iterable生成的连续项进行分组,在分组过程中会查找重复项。
如果iterable在多次连续迭代中生成了同一项,则会定义一个组,如果将此函数应用一个分类列表,那么分组将定义该列表中的所有唯一项,key(如果已提供)是一个函数,应用于每一项,如果此函数存在返回值,该值将用于后续项而不是该项本身进行比较,此函数返回的迭代器生成元素(key, group),其中key是分组的键值,group是迭代器,生成组成该组的所有项。


ifilter(predicate, iterable):
创建一个迭代器,仅生成iterable中predicate(item)为True的项,如果predicate为None,将返回iterable中所有计算为True的项。
ifilter(lambda x: x%2, range(10)) --> 1 3 5 7 9
&#160;

ifilterfalse(predicate, iterable)
创建一个迭代器,仅生成iterable中predicate(item)为False的项,如果predicate为None,则返回iterable中所有计算为False的项。
ifilterfalse(lambda x: x%2, range(10)) --> 0 2 4 6 8
&#160;

imap(function, iter1, iter2, iter3, ..., iterN)
创建一个迭代器,生成项function(i1, i2, ..., iN),其中i1,i2...iN分别来自迭代器iter1,iter2 ... iterN,如果function为None,则返回(i1, i2, ..., iN)形式的元组,只要提供的一个迭代器不再生成值,迭代就会停止。
  1. >>> from itertools import *
  2. >>> d = imap(pow, (2,3,10), (5,2,3))
  3. >>> for i in d: print i
  4. &#160;&#160; &#160;
  5. 32
  6. 9
  7. 1000&#160;&#160;&#160;
  8. >>>
  9. >>> d = imap(pow, (2,3,10), (5,2))
  10. >>> for i in d: print i
  11. &#160;&#160; &#160;
  12. 32
  13. 9
  14. >>>
  15. >>> d = imap(None, (2,3,10), (5,2))
  16. >>> for i in d : print i
  17. &#160;&#160; &#160;
  18. (2, 5)
  19. (3, 2)
复制代码


islice(iterable, [start, ]stop[, step])
创建一个迭代器,生成项的方式类似于切片返回值: iterable[start : stop : step],将跳过前start个项,迭代在stop所指定的位置停止,step指定用于跳过项的步幅。与切片不同,负值不会用于任何start,stop和step,如果省略了start,迭代将从0开始,如果省略了step,步幅将采用1.
  1. def islice(iterable, *args):
  2. &#160;&#160; &#160;# islice('ABCDEFG', 2) --> A B
  3. &#160;&#160; &#160;# islice('ABCDEFG', 2, 4) --> C D
  4. &#160;&#160; &#160;# islice('ABCDEFG', 2, None) --> C D E F G
  5. &#160;&#160; &#160;# islice('ABCDEFG', 0, None, 2) --> A C E G
  6. &#160;&#160; &#160;s = slice(*args)
  7. &#160;&#160; &#160;it = iter(xrange(s.start or 0, s.stop or sys.maxint, s.step or 1))
  8. &#160;&#160; &#160;nexti = next(it)
  9. &#160;&#160; &#160;for i, element in enumerate(iterable):
  10. &#160;&#160; &#160; &#160; &#160;if i == nexti:
  11. &#160;&#160; &#160; &#160; &#160; &#160; &#160;yield element
  12. &#160;&#160; &#160; &#160; &#160; &#160; nexti = next(it)

  13. #If start is None, then iteration starts at zero. If step is None, then the step defaults to one.
  14. 15 #Changed in version 2.5: accept None values for default start and step.
复制代码

&#160;
izip(iter1, iter2, ... iterN)
创建一个迭代器,生成元组(i1, i2, ... iN),其中i1,i2 ... iN 分别来自迭代器iter1,iter2 ... iterN,只要提供的某个迭代器不再生成值,迭代就会停止,此函数生成的值与内置的zip()函数相同。
  1. def izip(*iterables):
  2. &#160;&#160; &#160;# izip('ABCD', 'xy') --> Ax By
  3. &#160;&#160; &#160;iterables = map(iter, iterables)
  4. &#160;&#160; &#160;while iterables:
  5. &#160;&#160; &#160; &#160; &#160;yield tuple(map(next, iterables))
复制代码


izip_longest(iter1, iter2, ... iterN, [fillvalue=None])
与izip()相同,但是迭代过程会持续到所有输入迭代变量iter1,iter2等都耗尽为止,如果没有使用fillvalue关键字参数指定不同的值,则使用None来填充已经使用的迭代变量的值。
  1. def izip_longest(*args, **kwds):
  2. &#160;&#160; &#160; # izip_longest('ABCD', 'xy', fillvalue='-') --> Ax By C- D-
  3. &#160;&#160; &#160; fillvalue = kwds.get('fillvalue')
  4. &#160;&#160; &#160; def sentinel(counter = ([fillvalue]*(len(args)-1)).pop):
  5. &#160;&#160; &#160; &#160; &#160; yield counter() &#160; &#160; &#160; &#160; # yields the fillvalue, or raises IndexError
  6. &#160;&#160; &#160; fillers = repeat(fillvalue)
  7. &#160;&#160; &#160; iters = [chain(it, sentinel(), fillers) for it in args]
  8. &#160;&#160; &#160; try:
  9. &#160;&#160; &#160; &#160; &#160; for tup in izip(*iters):
  10. &#160;&#160; &#160; &#160; &#160; &#160; &#160;yield tup
  11. &#160;&#160; &#160;except IndexError:
  12. &#160;&#160; &#160; &#160; &#160;pass
复制代码


permutations(iterable[, r])
创建一个迭代器,返回iterable中所有长度为r的项目序列,如果省略了r,那么序列的长度与iterable中的项目数量相同:
  1. def permutations(iterable, r=None):
  2. &#160;&#160; &#160; # permutations('ABCD', 2) --> AB AC AD BA BC BD CA CB CD DA DB DC
  3. &#160;&#160; &#160; # permutations(range(3)) --> 012 021 102 120 201 210
  4. &#160;&#160; &#160;pool = tuple(iterable)
  5. &#160;&#160; &#160;n = len(pool)
  6. &#160;&#160; &#160;r = n if r is None else r
  7. &#160;&#160; &#160;if r > n:
  8. &#160;&#160; &#160; &#160; &#160;return
  9. &#160;&#160; &#160; indices = range(n)
  10. &#160;&#160; &#160; cycles = range(n, n-r, -1)
  11. &#160;&#160; &#160; yield tuple(pool[i] for i in indices[:r])
  12. &#160;&#160; &#160; while n:
  13. &#160;&#160; &#160; &#160; &#160; for i in reversed(range(r)):
  14. &#160;&#160; &#160; &#160; &#160; &#160; &#160; cycles[i] -= 1
  15. &#160;&#160; &#160; &#160; &#160; &#160; &#160; if cycles[i] == 0:
  16. &#160;&#160; &#160; &#160; &#160; &#160; &#160; &#160; &#160; indices[i:] = indices[i+1:] + indices[i:i+1]
  17. &#160;&#160; &#160; &#160; &#160; &#160; &#160; &#160; &#160; cycles[i] = n - i
  18. &#160;&#160; &#160; &#160; &#160; &#160; &#160; else:
  19. &#160;&#160; &#160; &#160; &#160; &#160; &#160; &#160; &#160; j = cycles[i]
  20. &#160;&#160; &#160; &#160; &#160; &#160; &#160; &#160; &#160; indices[i], indices[-j] = indices[-j], indices[i]
  21. &#160;&#160; &#160; &#160; &#160; &#160; &#160; &#160; &#160; yield tuple(pool[i] for i in indices[:r])
  22. &#160;&#160; &#160; &#160; &#160; &#160; &#160; &#160; &#160; break
  23. &#160;&#160; &#160; &#160; &#160; else:
  24. &#160;&#160; &#160; &#160; &#160; &#160; &#160; return
复制代码

&#160;
product(iter1, iter2, ... iterN, [repeat=1])
创建一个迭代器,生成表示item1,item2等中的项目的笛卡尔积的元组,repeat是一个关键字参数,指定重复生成序列的次数。
  1. def product(*args, **kwds):
  2. &#160;&#160; &#160;# product('ABCD', 'xy') --> Ax Ay Bx By Cx Cy Dx Dy
  3. &#160;&#160; &#160;# product(range(2), repeat=3) --> 000 001 010 011 100 101 110 111
  4. &#160;&#160; &#160;pools = map(tuple, args) * kwds.get('repeat', 1)
  5. &#160;&#160; &#160;result = [[]]
  6. &#160;&#160; &#160;for pool in pools:
  7. &#160;&#160; &#160; &#160; &#160;result = [x+[y] for x in result for y in pool]
  8. &#160;&#160; &#160;for prod in result:
  9. &#160;&#160; &#160; &#160; &#160;yield tuple(prod)
复制代码


repeat(object[, times])
创建一个迭代器,重复生成object,times(如果已提供)指定重复计数,如果未提供times,将无止尽返回该对象。
  1. def repeat(object, times=None):
  2. &#160;&#160; &#160;# repeat(10, 3) --> 10 10 10
  3. &#160;&#160; &#160;if times is None:
  4. &#160;&#160; &#160; &#160; &#160;while True:
  5. &#160;&#160; &#160; &#160; &#160; &#160; &#160;yield object
  6. &#160;&#160; &#160;else:
  7. &#160;&#160; &#160; &#160; &#160;for i in xrange(times):
  8. &#160;&#160; &#160; &#160; &#160; &#160; &#160;yield object
复制代码


starmap(func[, iterable])
创建一个迭代器,生成值func(*item),其中item来自iterable,只有当iterable生成的项适用于这种调用函数的方式时,此函数才有效。
  1. def starmap(function, iterable):
  2. &#160;&#160; &#160;# starmap(pow, [(2,5), (3,2), (10,3)]) --> 32 9 1000
  3. &#160;&#160; &#160;for args in iterable:
  4. &#160;&#160; &#160; &#160; &#160;yield function(*args)
复制代码


takewhile(predicate[, iterable])
创建一个迭代器,生成iterable中predicate(item)为True的项,只要predicate计算为False,迭代就会立即停止。
  1. def takewhile(predicate, iterable):
  2. &#160;&#160; &#160;# takewhile(lambda x: x<5, [1,4,6,4,1]) --> 1 4
  3. &#160;&#160; &#160;for x in iterable:
  4. &#160;&#160; &#160; &#160; &#160;if predicate(x):
  5. &#160;&#160; &#160; &#160; &#160; &#160; &#160;yield x
  6. &#160;&#160; &#160; &#160; &#160;else:
  7. &#160;&#160; &#160; &#160; &#160; &#160; &#160;break
复制代码

&#160;
tee(iterable[, n])
从iterable创建n个独立的迭代器,创建的迭代器以n元组的形式返回,n的默认值为2,此函数适用于任何可迭代的对象,但是,为了克隆原始迭代器,生成的项会被缓存,并在所有新创建的迭代器中使用,一定要注意,不要在调用tee()之后使用原始迭代器iterable,否则缓存机制可能无法正确工作。
  1. def tee(iterable, n=2):
  2. &#160;&#160; &#160;it = iter(iterable)
  3. &#160;&#160; &#160;deques = [collections.deque() for i in range(n)]
  4. &#160;&#160; &#160;def gen(mydeque):
  5. &#160;&#160; &#160; &#160; &#160;while True:
  6. &#160;&#160; &#160; &#160; &#160; &#160; &#160;if not mydeque: &#160; &#160; &#160; &#160; &#160; &#160; # when the local deque is empty
  7. &#160;&#160; &#160; &#160; &#160; &#160; &#160; &#160; &#160;newval = next(it) &#160; &#160; &#160; # fetch a new value and
  8. &#160;&#160; &#160; &#160; &#160; &#160; &#160; &#160; &#160;for d in deques: &#160; &#160; &#160; &#160;# load it to all the deques
  9. &#160;&#160; &#160; &#160; &#160; &#160; &#160; &#160; &#160; &#160; &#160;d.append(newval)
  10. &#160;&#160; &#160; &#160; &#160; &#160; &#160;yield mydeque.popleft()
  11. &#160;&#160; &#160;return tuple(gen(d) for d in deques)

  12. #Once tee() has made a split, the original iterable should not be used anywhere else; otherwise,&#160;
  13. the iterable could get advanced without the tee objects being informed.
  14. #This itertool may require significant auxiliary storage (depending on how much temporary data needs to be stored).&#160;
  15. In general, if one iterator uses most or all of the data before another iterator starts, it is faster to use list() instead of tee().
复制代码

本帖被以下淘专辑推荐:

想知道小甲鱼最近在做啥?请访问 -> ilovefishc.com
回复

使用道具 举报

发表于 2015-4-14 06:32:30 | 显示全部楼层
不错,看看
想知道小甲鱼最近在做啥?请访问 -> ilovefishc.com
回复 支持 反对

使用道具 举报

发表于 2015-4-17 01:52:43 | 显示全部楼层
顶一下
想知道小甲鱼最近在做啥?请访问 -> ilovefishc.com
回复 支持 反对

使用道具 举报

发表于 2015-4-17 01:53:51 | 显示全部楼层
顶一下
想知道小甲鱼最近在做啥?请访问 -> ilovefishc.com
回复 支持 反对

使用道具 举报

发表于 2015-5-13 01:34:22 | 显示全部楼层
谢谢分享!
想知道小甲鱼最近在做啥?请访问 -> ilovefishc.com
回复 支持 反对

使用道具 举报

发表于 2019-4-25 18:17:37 | 显示全部楼层
学习了
想知道小甲鱼最近在做啥?请访问 -> ilovefishc.com
回复 支持 反对

使用道具 举报

您需要登录后才可以回帖 登录 | 立即注册

本版积分规则

小黑屋|手机版|Archiver|鱼C工作室 ( 粤ICP备18085999号-1 | 粤公网安备 44051102000585号)

GMT+8, 2024-4-28 23:33

Powered by Discuz! X3.4

© 2001-2023 Discuz! Team.

快速回复 返回顶部 返回列表