Anomaly detection with apache Flink - apache-flink

I would like to know if there is an open issue or some related work (publication, platform) that is currently working in an Anomaly Detection approach with Apache Flink for a stream data scenario.
So far I just found the one of Mux by Scott Kidder from 2017. Is there something more recent or in working progress? Thanks!
EDIT
I also found the flink-htm using the HTM.java framework

I got the help of Flink community with the email list and this are some ideas that they provide me (thanks to Marta):
Flink in these presentations from Microsoft 1 and Salesforce Salesforce 2 at Flink Forward. This blogpost 3 also describes how to build that kind of application using Kinesis Data Analytics (based on Flink).

Related

Flink CLI vs Flink Web Console

We have a requirement where to replace flink console UI and enable all the functionalities of Flink Web console using CLI utilities, for some of the functionalities like starting job, save-points etc we are using Flink CLI.
My questions are
Does Flink CLI has parity with Flink Web UI Console?
If not, is there alternate ways to do things without ui what is possible via Flink Console (like checking/monitoring back pressure of a job etc)
I am trying to find a solution where on-call engineer can completely monitor and operate on flink using command line / terminal without need to go to web ui
Thanks in Advance
In theory the Flink CLI plus the REST api provide a superset of the functionality available via the web UI. But some things, like identifying a busy task that's causing backpressure, can be done much more quickly with the web UI. For monitoring and troubleshooting I think you'll need to either build some tooling and/or set up a metrics dashboard (e.g., using Grafana in combination with your preferred metrics reporter).

Zeppelin: Need to know more about zeppelin

I have recently started learning zeppelin. I know we can use angular and PostgreSQL e.t.c within it using interpreter. I have gone through its tutorial as well. But it is not as descriptive as I thought. I have many doubts which I am asking to you and which may help other beginners as well.
1> How we can create API for the zeppelin (if possible)?: As most of the client side apps uses API, is it possible to create API in zeppelin ? and in which language we can create API. If possible I am thinking to create API in java or node.js(JS).
2> Is it possible to integrate zeppelin graphs in any UI(angular or html ?)?
3> How we can deploy zeppelin based application in production environment ?
If you have any good tutorial source please attach it.
If I have asked unrelated questions please point out. I will change it.
Thanks in advance for provide help and giving you precious time!
Apache Zeppelin has wide and well described API [1]. You can use any language to work with API.
Yes [2]. You can embed the paragraph result to your website.
You can use binary package or built from source [3].
[4] contains a lot of code in setup section.
--
http://zeppelin.apache.org/docs/0.8.0/usage/rest_api/notebook.html
http://zeppelin.apache.org/docs/0.8.0/usage/other_features/publishing_paragraphs.html
http://zeppelin.apache.org/docs/0.8.0/quickstart/install.html
http://zeppelin.apache.org/docs/0.8.0/

Is there any way to index kafka outputs in Apache solr?

I'm new to Apache solr and I want to index data from kafka into solr. Can anyone give simple example of doing this ?
The easiest way to get started on this would probably be to use Kafka Connect.
Connect is part of the apache Kafka package, so should already be installed on your Kakfa node(s). Please refer to the quickstart for a brief introduction on how to run connect.
For writing data to Solr there are two connectors that you could try:
https://github.com/jcustenborder/kafka-connect-solr
https://github.com/MSurendra/kafka-connect-solr
While I don't have any experience with either of them, I'd probably try Jeremy's first based on latest commit and the fact that he works for Confluent.

Configure SOLR indexing with AEM 6.1

I am trying to configure SOLR indexing with AEM 6.1 but struggling with the following: -
What type of installation should be used for SOLR (embedded v/s
remote).
How to configure the selected installation with AEM 6.1.
How to determine whether the indexing has been successfully done.
Any help would be extremely appriciated.
I did lot of research on SOLR with AEM 6.0. I think I can answer to your question.
It depends on what are the requirements of the client. If the clients needs a quick search response and it have lot of content, you should prefer remove vs embedded. If the site has small content, the embedded could be sufficient. The remote permit to have a sharded cluster, so you can optimize the concurrent search response by adding more shared
The documentation of AEM is pretty well documented about this. You can find here: https://docs.adobe.com/docs/en/aem/6-1/deploy/platform/queries-and-indexing.html
Finally you can see the indexation well configured by enabling the logging configuration about the search (Look for to Enable Logging in the previous URL). You can test your search with tool in the ACS-Common with tools in ACS-Tools (http://adobe-consulting-services.github.io/acs-aem-tools/)
Hope this will help you.
Stéphane
I've stumbled upon these projects, as I searched for AEM Solr integration:
https://github.com/headwirecom/aem-solr-search
https://github.com/infielddesign/aem-id-solrindexer
Maybe it will help you with the Solr integration and configuration.
And as already mentioned above, the type of installation highly depends on your customer requirements.

Any Good Examples of Mobile Sync Framework Solutions

We are investigating using the Microsoft Mobile Sync Framework and I would like to put together a quick prototype. I am trying to pitch Mobile Sync Framework ofer another sync provider. I have zero experience with it so far, so I am looking for good examples of how to use it to sync backend relational database to SQL Server Compact database to enable working in an offline scenario. It would be very helpful to see a sample syncing solution with Oracle as the back end, but Sql Server should be fine to get an understanding.
Thanks,
Tom
Have you checked out Oracle Database Lite?
It includes a full sync solution for mobile devices, and is (obviously) compatible with an Oracle back end.
You can download it here:
http://www.oracle.com/technetwork/database/database-lite/downloads/index.html
The download includes full documentation and several source code examples.
see here: Sync Framework Resources Hope that thread helps
For an end to end solution for syncing between a mobile device and a server, have a look at the SyncComm project
Also this article from syncguru should give you a head start on writing your own provider for Oracle - it's quite straightforward to plug said provider into SyncComm - but I can if needs be provide further guidance.

Resources