bmiller1009/deduper
General deduping engine for JDBC sources with output to JDBC/csv targets
KotlinApache-2.0
Issues
- 0
Remove row ids from dupe output
#51 opened by bmiller1009 - 0
- 0
- 0
- 0
Update README after Asnyc code merge
#46 opened by bmiller1009 - 1
Add library build instructions to README
#41 opened by bmiller1009 - 0
Host dokka content on git
#47 opened by bmiller1009 - 0
Update javadocs for async merged code
#45 opened by bmiller1009 - 0
Refactor Consumer classes
#44 opened by bmiller1009 - 0
- 1
- 0
- 1
- 0
- 1
Add the ability for deduper to source hash values from a Kafka topic as well as write them to a target topic
#36 opened by bmiller1009 - 0
- 0
Improve and expand unit testing
#10 opened by bmiller1009 - 1
Publish library to maven central
#33 opened by bmiller1009 - 0
Add Dokka documentation
#34 opened by bmiller1009 - 0
Fill out proper README
#11 opened by bmiller1009 - 0
Performance metrics
#35 opened by bmiller1009 - 0
Improve csv output
#4 opened by bmiller1009 - 0
Use trove4j to store the long representation of string hashes when building up hash list in type loop
#28 opened by bmiller1009 - 0
- 0
- 1
- 1
- 1
Add command-line functionality
#8 opened by bmiller1009 - 1
- 1
- 1
Make dupe persistence more efficient
#19 opened by bmiller1009 - 0
- 0
- 0
- 0
Add ability to gather and persist hashes
#12 opened by bmiller1009 - 0
- 0
Change Builder to take in Csv/Sql target JNDI objects rather than just strings
#26 opened by bmiller1009 - 0
Option to delete dupe/target persistence
#6 opened by bmiller1009 - 0
- 0
Improve logging
#7 opened by bmiller1009 - 0
- 0
- 0
Add new jndi entries programmatically
#20 opened by bmiller1009 - 0
Make hash column primary key in SQL Persistor
#22 opened by bmiller1009 - 0
Dupe Count report should contain a count of all dupes as well as unique dupes
#21 opened by bmiller1009 - 0
File output defaults
#16 opened by bmiller1009 - 1
- 0
- 0
- 0
CI/CD pipeline
#9 opened by bmiller1009