present unpacked data frame as multi indexed

mentioned in issue #7 (closed)

I've quickly implemented the reduced tuple option: ('FIELD NAME', self.slot, self.link, self.pos) and got the following performance on my MBP 13" fall 2014:

1 loop, best of 5: 3.49 sec per loop

the index conversion can be done as:

events.columns = pd.MultiIndex.from_tuples(events.columns, names=['field_name','slot','link','position'])

and takes

CPU times: user 4 µs, sys: 1 µs, total: 5 µs
Wall time: 8.34 µs

Even if the pandas.MultiIndex did not exist, using tuples for the data frame column names would be beneficial. It would simplifies the unpacker (no need to "patch" the dictionary keys) as well as the data frame usage (no need to build strings for the column names).

Since the pandas.MultiIndex exist, the slicing/selection is made very easy as you presented.

Regarding the performances, you write,

1 loop, best of 5: 3.49 sec per loop

How does it compare to the previous implementation?

slightly faster. Was:

1 loop, best of 5: 3.9 sec per loop

maybe I was not 100% clear: adding tuples to VFATs speeds up the code. Adding tuples everywhere (required for mutiindexing as all column names should then be the same length tuples, otherwise only min length tuple can be used) slows down a little bit, but this impact is minor w.r.t. to "just VFAT" tuplization. Overall the code runs ~10% faster then with f-strings

It was clear and makes a lot of sense. All good then!

mentioned in merge request !3 (merged)

closed via merge request !3 (merged)

mentioned in commit f959f00c

	(Latency, -1, -1, -1)	(VFAT HIT, 1, 0, 0)	(VFAT HIT, 1, 0, 2)	(VFAT HIT, 1, 0, 3)	(VFAT HIT, 1, 1, 0)	(VFAT HIT, 1, 1, 2)
0	8	7	13	69	18	97
1	6	94	54	77	75	52
2	25	24	123	78	56	14
3	92	103	68	60	6	98
4	46	1	37	12	54	124

reg_name	Latency	VFAT HIT
slot	-1	1
link	-1	0			1
position	-1	0	2	3	0	2
0	8	7	13	69	18	97
1	6	94	54	77	75	52
2	25	24	123	78	56	14
3	92	103	68	60	6	98
4	46	1	37	12	54	124

reg_name	VFAT HIT
slot	1
link	0			1
position	0	2	3	0	2
0	7	13	69	18	97
1	94	54	77	75	52
2	24	123	78	56	14
3	103	68	60	6	98
4	1	37	12	54	124

reg_name	VFAT HIT
slot	1
position	0	2	3
0	7	13	69
1	94	54	77
2	24	123	78
3	103	68	60
4	1	37	12

present unpacked data frame as multi indexed

Summary

What is the expected correct behavior?

Relevant logs and/or screenshots

Designs

Child items ...

Activity