Skip to content
This repository has been archived by the owner on Dec 22, 2021. It is now read-only.

Analysis on #34, calculating percentage of scripts present in dataset #74

Closed
Soumya0803 opened this issue Mar 19, 2019 · 4 comments
Closed

Comments

@Soumya0803
Copy link
Contributor

Finding out the total number of scripts of the three types present in the dataset and calculating the percentage of each script and the three scripts considered together.

@birdsarah
Copy link
Contributor

birdsarah commented Mar 22, 2019

@Soumya0803 - can you elaborate why this is a meaningful / instructive thing to do? How does it advance the broader goal of #34?

@Soumya0803
Copy link
Contributor Author

Soumya0803 commented Mar 24, 2019

@birdsarah I was trying to find out which among the three has the greatest contribution to fingerprinting and start looking for that script first in the dataset followed by the other two and find out any individual characteristics and after that look for similarities.

@birdsarah
Copy link
Contributor

Hi. I'm still not completely sure that I understand. Let's break it down a little.

I was trying to find out which among the three has the greatest contribution to fingerprinting

The question is what is "contribution to fingerprinting" - these three scripts are not all the fingerprinting scripts, they're just known examples that we can look at. How does looking at the prevalence of these scripts relate to the population of all fingerprinting scripts?

find out any individual characteristics and after that look for similarities

This bit sounds right.

@birdsarah
Copy link
Contributor

This issue isn't actionable to general community, closing.

Sign up for free to subscribe to this conversation on GitHub. Already have an account? Sign in.
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants