Watson speech-to-text numericals - ibm-watson

Working with Speech to Text service and noticed that it doesn't recognize numbers when you say them. For example if I say three hundred and forty three it won't write '343' and writes 'three hundred and forty three' instead. Siri on the other hand can convert to three hundred and forty three to 343 any way I can get around this, using Watson?

We are working on this feature, but we don't have a release date yet. I will update this answer once we have it.

Related

Ive got a pipe that consists of 5 pieces, each including 5 properties

Inlet -> front -> middle -> rear -> outlet
Those five properties have a value anything between 4 - 40. Now i want to calculate a specific match for each of those values that is either a full 10 or a 5 when a single property is summed from each pipe piece. There might be hundreds of different pipe pieces all with different properties.
So if i have all 5 pieces and when summed, their properties go like 54,51,23,71,37. That is not good and not what im looking.
Instead 55,50,25,70,40. That would be perfect.
My trouble is there are so many of the pieces that it would be insane to do the miss'matching manually, and new ones come up frequently.
I have manually inserted about 100 of these already into SQLite, but should be easy to convert into any excel or other database formats, so answer can be related to anything like mysql or googlesheets.
I need the calculation that takes every piece in account and results either in "no match" or tells me the id of each piece that is required for a match and if multiple matches are available, it separates them.
Edit: Even just the math needed to do this kind of calculation would be a lot of help here, not much of a math guy myself. I guess there should be a reference piece i need to use and then that gets checked against every possible scenario.
If the value you want to verify is in A1, use: =ROUND(A1/5,0)*5
If the pipes may not be shorter than the given values, use =CEILING(A1,5)

how to efficient compute multi flink stream result from one source

for example:
There is a video play log, with format as:
user_id, video_id, play_duration, video_duration, country, request_date
I want to compute:
a. number of play event in last minute
b. number of play event today
c. the top10 most popular video (be played times) in last hour
d. the top10 most active user (play times) in last hour
In above 4 tasks, they consume same source, but with three different time window.
Are there any efficient solution to fulfill these jobs without start 3 or 4 jobs to duplicate consumption.
You can have a single job that splits the stream 4 ways, and then attach one of these window operators to each of the duplicated streams. You can do the splitting with either split/select, or with side outputs.

List of single word English nouns?

i'm trying to find a list of popular English nouns, any list ranging from about 6000 - 20000 words would be fine. I have a list at the moment consisting of almost 150,000 nouns but many of these words are not really nouns such as 'I', 'You', 'Main' etc. It also contains entries which consist of multiple words. I could write a short script to try and filter this list but i thought there might be a better option available. I have tried a few places on the web for stuff like English language learning but most of them only contain things like 'top 1500 nouns'.
Some time back I used some input from Stanford NLP (http://nlp.stanford.edu/), I think it's worth looking at. If they don't provide some static database they definitely have tools that will help you in cleaning up your list.

How to build a system to auto create paragraphs of text based on basic data

For example, given a basketball box score with the final score and the score at the end of each of the quarters, it could put together a summary of sorts:
Wildcats Win Close Game!
Despite a late charge by the Flying Eagles, the Wildcats hold on to win by 2.
The Wildcats started the game behind, though a surge through the middle of the
game put them ahead at the start of the fourth quarter. But then the Flying
Eagles put on the pressure late, though in the end, it wasn't enough.
I obviously don't want to build a huge if statement for every possible case, and I'm fine if the stories are identical for similar data sets. I don't have a language preference, mostly I'm curious about how to build something about this. References, keywords to search on, or places to learn this kinda thing would be greatly appreciated.

Best way of splitting website into Cities/Countries?

i am running a dating website, or better say, try to. I want to be as generic
as possible in terms of, coverage for countries.
Since it's a local dating oriented website, i have to keep track of cities
and so on so i am running into a few problems:
I need to have information about cities, people could join from
(a country i don't know anything about, i really can't decide about a city
or anything)
When people join from different countries, they would like to see people
near by. How would you approach this sort or problem?
This may seem like 2 easy points but they really make me some trouble these
days.
** Try to approach them as Database related issues **
I circled around SE and find this to be the best point to ask, not any other
SE website.
Thank you very much
The full name of a city in the USA is {country_name, state_name, city_name}. I say that assuming that, on a global scale, what I call state_name is not unique. I can easily imagine some region of a Latin American country being named "Nevada" or "Colorado".
You also have the problem of spelling. An American living in Germany might say she lived in Cologne, North Rhine-Westphalia, Germany. A German living there would call it "Köln".
"Nearby" is an even tougher nut to crack. I used to live in a big city. It wasn't unusual to drive an hour on an Interstate highway to pick up my date. I thought of that as "nearby". Now I live in a small city. Now 30 minutes seems like a long drive, but it's probably only 15 or 20 miles instead of 65 or 70.
If you live in a border town, 15 miles might be in a different country. Maybe two different countries.
I think your best bet is to use a geolocation API or service to get the latitude and longitude of new cities when they're inserted. The calculation of distance given two points is straightforward, but it leads to badly performing queries unless you use a bounding box. (You don't want to calculate the distance to every city in Texas to find people "near" Waco.)
Get the latitude/longitude for all of the cities. Don't worry about exact distances.
Play with excel and find the maximum change for latitude and/or longitude (Δlat or Δlong) for given distances (10miles, 25 miles, 50 miles), and then when you search for others, just search the database for within (Firstuser'sLongitude +/- Δlong) and within (Firstuser'sLatitude +/-Δlong).
I don't think people will mind the difference between 10miles and 10*2^0.5 miles.

Resources