产生摘要（“枢轴”？）表

邵骁

2023-03-14

问题内容：

我想要一种汇总数据库表的方法，以便将共享公共ID的行汇总为输出的一行。

我的工具是SQLite和Python2.x。

例如，鉴于以下是我当地超市的水果价格表，…

+--------------------+--------------------+--------------------+
|Fruit               |Shop                |Price               |
+--------------------+--------------------+--------------------+
|Apple               |Coles               |$1.50               |
|Apple               |Woolworths          |$1.60               |
|Apple               |IGA                 |$1.70               |
|Banana              |Coles               |$0.50               |
|Banana              |Woolworths          |$0.60               |
|Banana              |IGA                 |$0.70               |
|Cherry              |Coles               |$5.00               |
|Date                |Coles               |$2.00               |
|Date                |Woolworths          |$2.10               |
|Elderberry          |IGA                 |$10.00              |
+--------------------+--------------------+--------------------+

…我想生成一个汇总表，向我显示每个超市的每种水果的价格。空格应为NULL。

+----------+----------+----------+----------+
|Fruit     |Coles     |Woolworths|IGA       |
+----------+----------+----------+----------+
|Apple     |$1.50     |$1.60     |$1.70     |
|Banana    |$0.50     |$0.60     |$0.70     |
|Cherry    |NULL      |$5.00     |NULL      |
|Date      |$2.00     |$2.10     |NULL      |
|Elderberry|NULL      |NULL      |$10.00    |
+----------+----------+----------+----------+

我相信文献将其称为“数据透视表”或“数据透视查询”，但显然SQLite不支持PIVOT。（该问题的解决方案使用了LEFT JOINs的硬编码。这对我来说并没有太大的吸引力，因为我事先不知道“列”的名称。）

现在，我通过在Python中遍历整个表并累积的dictof来做到这一点dicts，这有点笨拙。我愿意使用Python或SQLite提供更好的解决方案，以表格形式给出数据。

问题答案：

在python方面，您可以使用itertools魔术来重新排列数据：

data = [('Apple',      'Coles',      1.50),
        ('Apple',      'Woolworths', 1.60),
        ('Apple',      'IGA',        1.70),
        ('Banana',     'Coles',      0.50),
        ('Banana',     'Woolworths', 0.60),
        ('Banana',     'IGA',        0.70),
        ('Cherry',     'Coles',      5.00),
        ('Date',       'Coles',      2.00),
        ('Date',       'Woolworths', 2.10),
        ('Elderberry', 'IGA',        10.00)]

from itertools import groupby, islice
from operator import itemgetter
from collections import defaultdict

stores = sorted(set(row[1] for row in data))
# probably splitting this up in multiple lines would be more readable
pivot = ((fruit, defaultdict(lambda: None, (islice(d, 1, None) for d in data))) for fruit, data in groupby(sorted(data), itemgetter(0)))

print 'Fruit'.ljust(12), '\t'.join(stores)
for fruit, prices in pivot:
    print fruit.ljust(12), '\t'.html" target="_blank">join(str(prices[s]) for s in stores)

输出：

Fruit        Coles      IGA     Woolw
Apple        1.5        1.7     1.6
Banana       0.5        0.7     0.6
Cherry       5.0        None    None
Date         2.0        None    2.1
Elderberry   None       10.0    None

产生摘要（“枢轴”？）表

相关阅读

相关文章

相关问答

相关工具

相关文档