Open-source system/service/database alternative to a search algorithm [closed] - database

Closed. This question does not meet Stack Overflow guidelines. It is not currently accepting answers.
We don’t allow questions seeking recommendations for books, tools, software libraries, and more. You can edit the question so it can be answered with facts and citations.
Closed 1 year ago.
Improve this question
I have a custom algorithm (written in java) to search in a list of strings (A) the string that is the longest substring of another string (B) (the longest common substring alg it's not suited in this case because the list of strings is big 100k+ ).
EX:
B -> "sadsaf dsfsc adsa 4 sad3 dfa fgs adsafd"
A -> ["fdsdf dsa", "adsa", "4 sad3", "cdsdfds dsa", "cx d45"]
And the result is "4 sad3" since its a subtring of B and also is
longer than "adsa" which is also a substring of B
I'm trying to find an alternative to a search algorithm using a system/service/database in order to externalize this algorithm .
What i've tried till now is:
mysql but it's pretty slow and it requires at least a ssd and a powerful cpu
elasticsearch using percolate query which i didn't benchmark yet but seems promising
redis but i didn't found a way to replicate the alg using their syntax
So any suggestion regarding a system/service/database that can do this relatively decent in terms of performance is appreciated, since the more options i'll have then the better (faster) the solution will be.

Related

What does this tip mean? [closed]

Closed. This question does not meet Stack Overflow guidelines. It is not currently accepting answers.
We don’t allow questions seeking recommendations for books, tools, software libraries, and more. You can edit the question so it can be answered with facts and citations.
Closed 4 years ago.
Improve this question
I need to write such a program:
Write a program that will fill an array with 30 elements of type int randomly selected from range 0 to 10. Print the array content on the screen. Then count and print out how many times each value appears in the array (tip: histogram).
Can somebody explain what this tip means?
Histogram is a representation, you can look it up online. Simply put in your, showing an item and its frequency(no:of occurrences) in a pictorial form. Check this image
Number column has the items that are present in your array, and count column has number of times each of them occur in your array(frequency). Frequency adds up to your array size - 30. Representing an item with its frequency is a histogram. And the graph is a pictorial representation.
The tip tells you what printing representation(number and its count) is.

Data sets for emotion detection in text [closed]

Closed. This question does not meet Stack Overflow guidelines. It is not currently accepting answers.
We don’t allow questions seeking recommendations for books, tools, software libraries, and more. You can edit the question so it can be answered with facts and citations.
Closed 5 years ago.
Improve this question
I'm implementing a system that could detect the human emotion in text. Are there any manually annotated data sets available for supervised learning and testing?
Here are some interesting datasets:
https://dataturks.com/projects/trending
The field of textual emotion detection is still very new and the literature is fragmented in many different journals of different fields. Its really hard to get a good look on whats out there.
Note that there a several emotion theories psychology. Hence there a different ways of modeling/representing emotions in computing. Most of the times "emotion" refers to a phenomena such as anger, fear or joy. Other theories state that all emotions can be represented in a multi-dimensional space (so there is an infinite number of them).
Here are a some (publicly available) data sets I know of (updated):
EmoBank. 10k sentences annotated with Valence, Arousal and Dominance values (disclosure: I am one of the authors). https://github.com/JULIELab/EmoBank
The "Emotion Intensity in Tweets" data set from the WASSA 2017 shared task. http://saifmohammad.com/WebPages/EmotionIntensity-SharedTask.html
The Valence and Arousal Facebook Posts by Preotiuc-Pietro and
others:
http://wwbp.org/downloads/public_data/dataset-fb-valence-arousal-anon.csv
The Affect data by Cecilia Ovesdotter Alm:
http://people.rc.rit.edu/~coagla/affectdata/index.html
The Emotion in Text data set by CrowdFlower
https://www.crowdflower.com/wp-content/uploads/2016/07/text_emotion.csv
ISEAR:
http://emotion-research.net/toolbox/toolboxdatabase.2006-10-13.2581092615
Test Corpus of SemEval 2007 (Task on Affective Text)
http://web.eecs.umich.edu/~mihalcea/downloads.html
A reannotation of the SemEval Stance data with emotions:
http://www.ims.uni-stuttgart.de/data/ssec
If you want to go deeper into the topic, here are some surveys I recommend (disclosure: I authored the first one).
Buechel, S., & Hahn, U. (2016). Emotion Analysis as a Regression Problem — Dimensional Models and Their Implications on Emotion Representation and Metrical Evaluation. In ECAI 2016.22nd European Conference on Artificial Intelligence (pp. 1114–1122). The Hague, Netherlands (available: http://ebooks.iospress.nl/volumearticle/44864).
Canales, L., & Martínez-Barco, P. (n.d.). Emotion Detection from text: A Survey. Processing in the 5th Information Systems Research Working Days (JISIC 2014), 37 (available: http://www.aclweb.org/anthology/W14-6905).

Database which has categorized the english words into matching emotion [closed]

Closed. This question does not meet Stack Overflow guidelines. It is not currently accepting answers.
We don’t allow questions seeking recommendations for books, tools, software libraries, and more. You can edit the question so it can be answered with facts and citations.
Closed 4 years ago.
Improve this question
is there a database or api which has the categorized version of English words into the matching emotion?
e.g: - http://www.psychpage.com/learning/library/assess/feelings.html
One useful resource is the NRC Word-Emotion Association Lexicon compiled by Saif Mohammad. It lists the sentiments (positive, negative) and emotions (anger, anticipation, disgust, fear, joy, sadness, surprise, trust) for around 14,000 English words.
I would take a look into the topic of http://en.wikipedia.org/wiki/Sentiment_analysis If you are good with doing it in Python take a look at this demo: http://text-processing.com/demo/sentiment/ Which is able to get if a sentence is positive or negative using NLTK.

Count a 2D array within a range [closed]

Closed. This question does not meet Stack Overflow guidelines. It is not currently accepting answers.
This question does not appear to be about a specific programming problem, a software algorithm, or software tools primarily used by programmers. If you believe the question would be on-topic on another Stack Exchange site, you can leave a comment to explain where the question may be able to be answered.
Closed 9 years ago.
Improve this question
I have a map that I want to separately count the patterns of different numbers.
Without VB, I want to be able to create a dynamic counter that will be able to count the patterns of numbers.
For example:
I want to count how many times, even if it overlaps that this pattern occurs in the map
2 2
2 2
Counting I can see the pattern occurs six times but I'm struggling to create a simple array formula that will be able to do so
I've been told of success with and IF function with nested AND functions so I know it can be done without VB.
Use the formula
=COUNTIFS(A1:E15,2,B1:F15,2)
notice how the two areas are adjacent - one column offset from each other.
You can extend this to find two-by-two regions:
=COUNTIFS(A1:E14,2,B1:F14,2,A2:E15,2,B2:F15,2)
just be very careful about how the different ranges are offset.
An alternative way to write this which, I suspect, will be more efficient for large ranges is:
=SUMPRODUCT((A1:E14=2)*(B1:F14=2)*(A2:E15=2)*(B2:F15=2))

Coding site with test harness for basic data structures like linked list,graph,strings etc [closed]

Closed. This question is off-topic. It is not currently accepting answers.
Want to improve this question? Update the question so it's on-topic for Stack Overflow.
Closed 11 years ago.
Improve this question
Are there any coding sites with test harness for basic data structures and with basic problems like arrays,strings,linked list,graph adjacency list etc, so that I can brush up on basic codes like reversing linked list, checking if linked list is palindrome, sorting linked list, finding a substring in a string etc. This would make basic DS brushing fun.
I know there are sites like codechef and topcoder, but from what I have seen, these don't have such basic list,graph conceptual DS questions and test harness.
And, it can also be perhaps used for undergrad course in DS..

Resources