flink-extended/dl-on-flink
Deep Learning on Flink aims to integrate Flink and deep learning frameworks (e.g. TensorFlow, PyTorch, etc) to enable distributed deep learning training and inference on a Flink cluster.
JavaApache-2.0
Issues
- 0
运行Linear.java python报dl_on_flink_framework依赖不存在
#764 opened by I-am-DJ - 2
- 0
[Tensorflow On Flink] Batch train heartbeat failed
#540 opened by wuchaochen - 2
looks like property key REMOTE_CODE_ZIP_FILE("remote_code_zip_file") has no effect when training without input
#744 opened by 1202zhyl - 0
- 1
- 1
TFUtils python API adapt to the new Java API
#725 opened by Sxnan - 1
Supports Flink iteration for batch training
#713 opened by Sxnan - 1
- 5
worker重启问题
#720 opened by xdyun - 0
- 3
[Flink AI Flow] Kindly provide an example about real time feature engineering of window and global features
#591 opened by LongxingTan - 0
Quick start example run indefinitely on Tensorflow 2.3
#706 opened by Sxnan - 5
- 2
File system scheme 'queue' not implemented
#696 opened by 1202zhyl - 2
- 1
- 1
- 0
- 0
- 1
[Flink AI Flow] Support database upgrade and downgrade
#541 opened by wuchaochen - 1
- 1
[Flink AI Flow] [Discussion Needed] Metadata client in Java cannot register workflow with ContextExtractor
#487 opened by bgeng777 - 0
[AIFlow] AIFlow base docker image for deployment
#440 opened by Sxnan - 0
[AIFlow] Add job retry mechanism
#442 opened by Sxnan - 0
[AirFlow] 周期性作业恢复失败
#627 opened by wuchaochen - 0
没有install_aiflow.sh这个文件
#653 opened by Curry30h - 0
Failed to execute goal org.apache.maven.plugins:maven-javadoc-plugin:2.9.1:jar (attach-javadocs) on project flink_ai_extended: Execution attach-javadocs of goal org.apache.maven.plugins:maven-javadoc-plugin:2.9.1:jar failed: Plugin org.apache.maven.plugins:maven-javadoc-plugin:2.9.1 or one of its dependencies could not be resolved: Could not transfer artifact org.apache.maven:maven-project:jar:2.2.1 from/to central
#643 opened by Curry30h - 0
[Flink AI Flow] Flink Job cannot be cancelled due to wrong flink command generation
#539 opened by bgeng777 - 1
[Flink AI Flow]The wdl example cannot be run
#632 opened by wuchaochen - 0
- 0
[Flink AI Flow] The state of dagrun is running even all tasks are finished
#467 opened by jiangxin369 - 0
- 0
- 0
[Airflow] Add helm chart to deploy AIFlow to kubernetes
#444 opened by Sxnan - 1
Performance issues in flink-ai-flow/ (by P3)
#517 opened by DLPerf - 1
Performance issues in flink-ai-flow/ai_flow/test/util/model_util/iris_data_utils.py(P2)
#491 opened by DLPerf - 1
[Flink AI Flow] Executor support CeleryExecutor
#463 opened by aqua7regia - 0
- 0
- 3
[Flink AI Flow][Bug] When using FlinkPythonProcessor, not calling execution_context.statement_set.add_insert_sql will result in error, and missing job_id
#475 opened by lisy09 - 1
- 0
- 1
[Flink AI Flow] Dagrun may not scheduled
#547 opened by jiangxin369 - 1
- 1
- 2
- 1
- 1
[Notification Service] Sending event with HA client to non-HA server takes too much time
#506 opened by jiangxin369 - 1
[Flink AI Flow][Bug][unit test] When running run_tests.sh, sometime grpc UNAVAILABLE happens
#480 opened by lisy09