Desining a database for word frequency and text analysis [closed] - database

Closed. This question is opinion-based. It is not currently accepting answers.
Want to improve this question? Update the question so it can be answered with facts and citations by editing this post.
Closed 9 years ago.
Improve this question
I have a bunch of articles, on which I want to do word frequency and trend analysis.
The articles are tagged with date, author, theme and subject. I want to use these tags to slice the data so that I can get the most common words used for a specific author (or group of authors), theme(s) or subject(s). Overall and over time (trend).
How would I design this database (relational or other) or should I create a data cube?

Rizzoma.com made this with couchDB (noSQL) and Sphinx (fulltext search engine).
You can try to make it in another way, if you want, or test existing solution and repeat it.

Related

Whether the amount of data matters in data analytics? [closed]

Closed. This question is opinion-based. It is not currently accepting answers.
Want to improve this question? Update the question so it can be answered with facts and citations by editing this post.
Closed 3 years ago.
Improve this question
So i want to know whether the data analytics can be done by using a little bit amount of data like 100 to 1000 records stored in a database.If i do so then is it called data analytics?
Somebody saying that it is not at all called data analytics if you are analysing a small amount of data.
So i am confusion the data analytics with big data. So can anyone answer me for this?
My big thanks in advance
If you are analyzing data to discover information to aid decision making, It will be called data analytics irrespective of your size of the data. However, yours might not be called as big data analytics. That would have been what your friend/colleague meant.

Inorder to have a search feature on data in database which one has better performance: elasticsearch or sphinx? [closed]

Closed. This question needs to be more focused. It is not currently accepting answers.
Want to improve this question? Update the question so it focuses on one problem only by editing this post.
Closed 4 years ago.
Improve this question
We have a database which stores 10 millions of records every day. I wanna have a filter query on data within specific date range but it's very slow on SQL Server, then I decided to move data to one of the Full-Text Search engine such as elasticsearch or sphinx to have faster search but I wander which one has greater performance in search.
I have used elasticsearch before and familiar with it's features but do not have any experience with sphinx.
thnx

CouchDB: Write and update documents, not read [closed]

Closed. This question needs to be more focused. It is not currently accepting answers.
Want to improve this question? Update the question so it focuses on one problem only by editing this post.
Closed 5 years ago.
Improve this question
Quick question: Is there a way to let a user write (/create) documents and let him update and read his own documents, but not read documents of other users in a shared database?
No, fine grained security is done as design documents for updates and not for reading. You can:
use database-per-user and replicate them to the shared database.
have the user only add to their own _users document as _users implements this special case.

Images In SQL Server [closed]

Closed. This question is opinion-based. It is not currently accepting answers.
Want to improve this question? Update the question so it can be answered with facts and citations by editing this post.
Closed 8 years ago.
Improve this question
Is it better to save a image path as (varchar) in a database or save it as blob/varbinary. The image will appear on front end and I will be using asp classic to implement that.
Generally speaking I find that it is easier to save the image path in the database. This makes the database more portable and I find that it is easier to control access to the data.
There is a discussion here that gets further in depth. They come to the same conclusion that I do. File paths are generally better.

Benefits of Using Datasets [closed]

Closed. This question is opinion-based. It is not currently accepting answers.
Want to improve this question? Update the question so it can be answered with facts and citations by editing this post.
Closed 8 years ago.
Improve this question
I understand that datasets contain datatables and that they can house the relationships between those datatables. I am making a simple form that gets data from SQL Server by way of stored procedures that return the records I need subject to certain parameters. It is not strictly necessary that I model relationships between the datables. Are there other benefits of using a dataset to contain them, or am I just as well off to leave them free standing?
For example, you can automatically perform actions on related tables (autodelete). Or you can add restrictions which will depend on such relations.
And it will help you to draw a database diagram.

Resources