NiFi as a Operate in DataFlow Service
Introduction
With the overall availability of Cloudera DataFlow for the Public Cloud (CDF-PC), our clients can now self-serve deployments of Apache NiFi information flows on Kubernetes clusters in a price efficient means offering auto scaling, useful resource isolation and monitoring with KPI-based alerting.
You could find extra data on this launch announcement weblog put up and on this technical deep dive weblog put up. Any buyer prepared to run NiFi flows effectively at scale ought to now think about adopting CDF-PC.
Nonetheless, for sure use circumstances, we wish to go one step additional. Right this moment, when clients wish to course of recordsdata as they land right into a bucket, or expose microservices that can be referred to as intermittently, they should run NiFi flows as a protracted working utility which isn’t value efficient.That is the rationale we wish to present our clients with a very serverless choice for working NiFi flows and to introduce NiFi as a Operate in DataFlow Service, which is on the market as a personal Tech Preview as of right this moment. NiFi as a Operate in DataFlow Service gives an environment friendly, value optimized, scalable strategy to run NiFi flows in a very serverless trend. That is significantly highly effective every time the use case is occasion pushed and there’s no want for NiFi situations to at all times be up and working.
For individuals not acquainted with NiFi, NiFi as a Operate in DataFlow Service permits the primary no-code UI permitting builders to take management of the complete lifecycle of capabilities. In a matter of minutes, you may develop and deploy capabilities for all cloud suppliers.
Features as a Service
Features as a Service (FaaS) is a class of cloud computing providers that each one primary cloud suppliers are providing (AWS Lambda, Azure Features, Google Cloud Features, and many others). It permits clients to run micro functions which are triggered on particular occasions with out the complexity of constructing and sustaining the structure related to the operation and launching of the functions.
It additionally successfully gives a serverless structure and may be very broadly used when constructing microservices functions. By serverless we imply that assets are provisioned solely when and whereas information is being processed by the applying. This manner you don’t want at all times up and working assets to serve your utility. That is essentially the most value efficient means of working functions that solely must course of information following particular occasions.
With NiFi as a Operate, DataFlow Service will allow builders to carry out operate lifecycle administration utilizing the NiFi no-code designer and the DF Service Catalog after which run that move backed by cloud suppliers’ managed FaaS.
Occasion pushed use circumstances
When configuring a operate in a FaaS resolution, a set off should be specified. This half is managed by the cloud supplier to search for particular occasions to occur and fireplace the configured operate when such occasions happen. There are a lot of triggers offered by the cloud suppliers for each FaaS resolution on the market. Some quite common triggers are those firing the operate every time a file lands in a bucket, every time a message is obtained in a subject of a message queue resolution, every time an HTTP request is made to a selected endpoint, and plenty of others.
How does this translate in NiFi? To be used circumstances the place NiFi must course of information following a selected occasion, NaaF gives an environment friendly, value optimized, and scalable strategy to run NiFi flows to course of the information with out the necessity to have a protracted working move.
Good examples of actual life use circumstances for NiFi as a Operate embrace processing recordsdata as quickly as they’re obtained in a bucket, actual time ingestion of logs obtained from a really giant variety of sources, exposing HTTP endpoints to supply microservices and processing information obtained from sensors within the IoT area. NaaF working within the Cloud suppliers’ FaaS makes it straightforward to chain collectively a number of capabilities and supply nearly limitless scaling with no ops.
With NiFi as a Operate, the DataFlow service will speed up the event of your operate utilizing the NiFi no-code UI, supply an ever rising set of processors and integrations to course of your information, and allow a sturdy SDLC resolution round it. Through the use of NiFi as a Operate, you don’t must code your capabilities your self anymore. Simply design your move in NiFi and you’ll be up and working in a couple of minutes whereas leveraging the 400+ processors already out there!
You wish to compress some information as quickly because it lands into S3? Create your dataflow and switch it into an AWS Lambda operate with an S3 set off in a couple of minutes:

You wish to create a nearly limitless scalable HTTPS gateway for pushing information into Kafka? That’s straightforward, simply develop your move and use the API Gateway Set off:

Conclusion
With the addition of NiFi as a Operate within the DataFlow Service, Cloudera permits Apache NiFi as the primary no-code UI for constructing and working capabilities very effectively for a really big selection of use circumstances. Watch our Dwell Demo Jam: when and methods to use NiFi Stateless to run information move.



