FLaNK AI Weekly 18 March 2024

FLaNK AI Weekly 18 March 2024

18-March-2024

FLaNK Stack Weekly

Tim Spann @PaaSDev

https://pebble.is/PaaSDev

https://vimeo.com/flankstack

https://www.youtube.com/@FLaNK-Stack

https://www.threads.net/@tspannhw

https://medium.com/@tspann/subscribe

https://www.cloudera.com/campaign/apache-nifi-for-dummies.html

https://ossinsight.io/analyze/tspannhw

Congrats to my wife for being the youngest Leader of our local Elks!

CODE + COMMUNITY

Please join my meetup group NJ/NYC/Philly/Virtual.

http://www.meetup.com/futureofdata-princeton/

https://www.meetup.com/futureofdata-newyork/

https://www.meetup.com/futureofdata-philadelphia/

*This is Issue #129 *

https://github.com/tspannhw/FLiPStackWeekly

https://www.cloudera.com/solutions/dim-developer.html

New Releases

https://cldr-steven-matison.github.io//blog/CEM-2.1.2-Release/

Articles

Image Processing with Custom Python and Apache NiFi 2.0
https://medium.com/@tspann/image-processing-with-custom-python-and-nifi-2-0-06eadc62c03c

Mixtral Deep Dive
https://dzone.com/articles/mixtral-generative-sparse-mixture-of-experts-in-da

AI Augmented DevRel part 1
https://medium.com/@tspann/ai-augmented-devrel-part-1-4058af905a89

Next Level Flink with Nussknacker
https://medium.com/@tspann/next-level-flink-with-nussknacker-fe7294e2ef21

Mixtral Generative Sparse Mixture of Experts in DataFlows
https://medium.com/@tspann/mixtral-generative-sparse-mixture-of-experts-in-dataflows-59744f7d28a9

https://news.mit.edu/2024/researchers-enhance-peripheral-vision-ai-models-0308

https://medium.com/@1709deepesh/connecting-apache-nifi-to-microsoft-graph-reading-emails-with-invokehttp-processors-6d84db9fa157

https://community.cloudera.com/t5/Community-Articles/How-to-call-a-CML-Deployed-Model-From-Apache-NiFi-in-10/ta-p/374853

https://www.infoq.com/news/2024/03/java-22-so-far/

https://www.infoq.com/news/2024/03/lapce-rust-editor

https://www.infoq.com/news/2024/03/mistral-ai-aws/

https://www.infoq.com/news/2024/03/anthropic-claude-ai/

https://www.quantamagazine.org/new-breakthrough-brings-matrix-multiplication-closer-to-ideal-20240307/

https://venturebeat.com/ai/hugging-face-is-launching-an-open-source-robotics-project-led-by-former-tesla-scientist/?

https://dbos-project.github.io/

https://www.datanami.com/2024/03/07/cloudera-unveils-next-phase-of-open-data-lakehouse-to-unlock-enterprise-ai/?

https://www.decodable.co/blog/taxonomy-of-data-change-events

https://www.europarl.europa.eu/news/en/press-room/20240308IPR19015/artificial-intelligence-act-meps-adopt-landmark-law

https://medium.com/plain-simple-software/the-llm-app-stack-2024-eac28b9dc1e7

https://www.slideshare.net/JulienSIMON5/an-introduction-to-computer-vision-with-hugging-face

https://huggingface.co/learn/nlp-course/chapter1/2?fw=pt

https://huggingface.co/timm

https://github.com/huggingface/pytorch-image-models

https://www.slideshare.net/JulienSIMON5/an-introduction-to-computer-vision-with-hugging-face

https://www.infoq.com/news/2024/03/azure-openai-your-data-ga/

https://developers.redhat.com/articles/2024/03/13/kafka-tiered-storage-deep-dive?

https://community.cloudera.com/t5/What-s-New-Cloudera/Cloudera-DataFlow-adds-Change-Data-Capture-processors-flow/ba-p/381727

Videos

Streaming Traffic Cameras
https://www.youtube.com/watch?v=85ECRGJBEQU&ab_channel=DatainMotion-HowToBeaStreamingEngineer

Python Processor
https://www.youtube.com/watch?v=jF5FSY0xFiQ&t=9s&ab_channel=DatainMotion-HowToBeaStreamingEngineer

Preview of TCF Pro Talk
https://youtu.be/ce9lhtbp48M?si=Svjb2-bIIPXLwXD1

Feb 22, 2024 NYC Meetup

https://www.slideshare.net/slideshows/2024-feb-ai-meetup-nyc-genaillmsmldata-codeless-generative-ai-pipelines/266444687

Feb 28, 2024 NYC Flink Meetup

https://www.slideshare.net/slideshows/2024-february-28-nyc-meetup-unlocking-financial-data-with-realtime-pipelines/266539528

Feb 29, 2024 Conf42 Python 2024

https://www.slideshare.net/slideshows/conf42python-using-apache-nifi-apache-kafka-risingwave-and-apache-iceberg-with-stock-data-and-llm/266521940

https://www.slideshare.net/slideshows/conf42pythonbuilding-apache-nifi-20-python-processors/266522007

https://www.youtube.com/watch?v=awxzG7laWx4&ab_channel=Conf42

https://www.youtube.com/watch?v=FD16_oZ65Ug&ab_channel=Conf42

March 11, 2024 Princeton 23 Orchard Event

https://www.slideshare.net/slideshows/2024-build-generative-ai-for-nonprofits/266748822

march 15, 2024 Trenton TCF

https://www.slideshare.net/slideshows/tcfpro24-building-realtime-generative-ai-pipelines/266807785

Events

March 27, 2024: Startup Grind. Jersey City
https://www.startupgrind.com/events/details/startup-grind-princeton-presents-startup-grind-princeton-amp-nj-big-data-alliance-generative-ai-reverse-pitch/

March 28, 2024: Pinot + NiFi + Flink + Kafka Meetup NYC
https://www.meetup.com/real-time-analytics-meetup-ny/events/299290822/

April 2, 2024: XtremeJ 2024. Virtual.
https://xtremej.dev/2023/schedule/

April 8-11, 2024: NLIT Summit. Seattle.
https://www.fbcinc.com/e/nlit/default.aspx

April 11, 2024: Conf42 LLM. Virtual.
https://www.conf42.com/llms2024

April 12, 2024: AI Max Conference. 23 Orchard Princeton
https://www.startupgrind.com/events/details/startup-grind-princeton-presents-startup-grind-hosts-ai-max-summit/

April 2024: AI Meetup NJ
https://www.meetup.com/nj-gai/

May 8-9, 2024: Data Summit 2024. Boston, MA.
https://www.dbta.com/DataSummit/2024/default.aspx

Cloudera Events
https://www.cloudera.com/about/events.html

More Events:
https://www.linkedin.com/pulse/schedule-2024-tim-spann–y4coe

Code

https://github.com/tspannhw/FLaNK-python-processors

Models

https://github.com/urchade/GLiNER
https://github.com/deepseek-ai/DeepSeek-VL
https://github.com/sieve-community/fast-asd
https://arxiv.org/abs/2403.09611
https://github.com/xai-org/grok-1

Datasets

https://hf.co/datasets

Tools

https://github.com/echasnovski/mini.nvim/blob/main/readmes/mini-indentscope.md
https://datavolo.io/2024/03/data-engineering-for-advanced-rag-small-to-big-with-pinecone-langchain-and-datavolo/
https://docs.pinecone.io/docs/metadata-filtering
https://python.langchain.com/docs/modules/data_connection/retrievers/parent_document_retriever
https://docs.pinecone.io/reference/list
https://colab.research.google.com/drive/1AvPpRvzLvGPMG3vVgcRDOvSdar_lsTks
https://github.com/pi-ra/beesy-issue-tracker
https://palindromicity.blogspot.com/2020/06/dotifi-generating-dot-files-from-apache.html
https://github.com/flxzt/rnote
https://github.com/teableio/teable
https://localsend.org/#/
https://github.com/leobeeson/llm_benchmarks
https://www.cognition-labs.com/blog
https://github.com/openvinotoolkit/openvino_notebooks/tree/recipes/recipes/defect_detection_anomalib
https://brave.com/
https://github.com/PeerDB-io/peerdb
https://osm2pgsql.org/
https://arc.net/
https://github.com/truera/trulens
https://github.com/has2k1/plotnine
https://github.com/altair-viz/altair
https://github.com/quarto-dev/quarto-cli
https://github.com/tobymao/sqlglot
https://github.com/quarkiverse/quarkus-langchain4j
https://github.com/ELLA-Diffusion/ELLA
https://github.com/skills-cogrammar/C7-Lecture-Backpack
https://github.com/LucasPickering/slumber
https://webhook.site/
https://github.com/paveldedik/ludic
https://github.com/bananaml/fructose
https://github.com/betwixt-labs/bebop
https://github.com/jafioti/luminal
https://github.com/soorajshankar/logScreen
https://github.com/flydelabs/flyde
https://lite.ip2location.com/ip2location-lite
https://github.com/getindata/flink-http-connector
https://github.com/phospho-app/phospho
https://github.com/phospho-app/fastassert
https://github.com/developersdigest/llm-answer-engine
https://docs.litellm.ai/docs/proxy/quick_start
https://letsbuild.ai/
https://github.com/albertan017/LLM4Decompile
https://vector.dev/
https://github.com/ArroyoSystems/arroyo
https://github.com/stanfordnlp/pyvene
https://www.mewho.com/titan/
https://radicle.xyz/
https://github.com/ianand/spreadsheets-are-all-you-need

Tips

https://www.datainmotion.dev/2020/05/one-minute-nifi-tip-calcite-sql-notes.html

Cool Tool

These are amazing diagrams and graphics.

https://drawify.com/templates/341/personal-user-manual

© 2020-2024 Tim Spann

Leave a Reply

Your email address will not be published. Required fields are marked *