Anybody can reshape my list

More
5 years 10 months ago #17335 by smmsamm
How can I have some melting function in your software?

I am beginner in data mining,

I have a list of 10000 rows and about 200 columns (with about 100000 different values). the Simple data is like this :

look,1,2,3,4,5,6,7,8
book,4,5,6,7,8,102,104,107
look,6,7,8,9
hook,100,101,102
cook,7,8,9
build,102,103,104,107
hook,103,104,105
...


At first I need to make unique list of words:

look,1,2,3,4,5,6,7,8,9
book,4,5,6,7,8,102,104,107
hook,100,101,102,103,104,105
cook,7,8,9
build,102,103,104,107

Now I need to find lines with at least 3 (or n) similar values and generate a new list:

look,1,2,3,4,5,6,7,8,9
book,4,5,6,7,8,102,104,107
cook,7,8,9
*************

book,4,5,6,7,8,102,104,107
build,102,103,104,107
*************

hook,100,101,102,103,104,105
build,102,103,104,107
*************

please Tell me which software can help me and how?
Anybody can generate/show me a sample?
Please help me in anyway!
Thank you

P.s. Someone said me, python melt function can help me!

Please Log in or Create an account to join the conversation.

More
5 years 10 months ago #17336 by admin
Replied by admin on topic Anybody can reshape my list
Please stop posting same thing all the time or we ban you.

Mike
ETL Architect

Please Log in or Create an account to join the conversation.

More
5 years 10 months ago #17338 by smmsamm
Replied by smmsamm on topic Anybody can reshape my list
Oh excuse me I thought each forum parts is apart of others, I tried to delete two other my topics but I couldn't, please you delete those, but maybe here someone can show me a demo.
Thank you

Please Log in or Create an account to join the conversation.

More
5 years 10 months ago #17339 by admin
Replied by admin on topic Anybody can reshape my list
Good starting point for you would be installing the software and watching the tutorials

Mike
ETL Architect

Please Log in or Create an account to join the conversation.

More
5 years 10 months ago - 5 years 10 months ago #17342 by Peter.Jonson
Hi Smmsamm.

Mike asked me to have a look at it for you.

Making unique list of words is quite easy.

What you need to do is to transform the original list

look,1,2,3,4,5,6,7,8
book,4,5,6,7,8,102,104,107
look,6,7,8,9
hook,100,101,102
cook,7,8,9
build,102,103,104,107
hook,103,104,105

Into

look,1
look,2
look,3
look,4
look,5
look,6
look,7
look,8
book,4
book,5
book,6
book,7
book,8
book,102
book,104
book,107

Sort it, de duplicate it and convert it into original format.

5 MINUTES JOB

Peter Jonson
ETL Developer
Last edit: 5 years 10 months ago by Peter.Jonson.
The following user(s) said Thank You: smmsamm

Please Log in or Create an account to join the conversation.

More
5 years 10 months ago - 5 years 10 months ago #17344 by smmsamm
Replied by smmsamm on topic Anybody can reshape my list
Thank you, I think this method will generate first part, it is simple in programming but even with nested query it is difficult to fetch and finding duplicate data in unknown fields, I need to see etl tools can generate part two:

finding lines with at least 3 (or n) similar values (in variant unknown columns|fields) and generate a new list:

look,1,2,3,4,5,6,7,8,9
book,4,5,6,7,8,102,104,107
cook,7,8,9
*************

book,4,5,6,7,8,102,104,107
build,102,103,104,107
*************

hook,100,101,102,103,104,105
build,102,103,104,107
*************
Last edit: 5 years 10 months ago by smmsamm.

Please Log in or Create an account to join the conversation.