nightscape/spark-excel

Co-maintainers wanted

Opened this issue · 25 comments

Due to personal and professional constraints, the development of this library has been rather slow.
If you find value in this library, please consider stepping up as a co-maintainer.
Help is very welcome e.g. in the following areas:

  • Additional features
  • Code improvements and reviews
  • Bug analysis and fixing
  • Documentation improvements
  • Build / test infrastructure

If you think you can contribute in any of the above areas, please leave a comment below and I will get in touch with you on how to best proceed.

Thank you!

Hi @carlo-quinonez, thank you!
Issue grooming would be quite helpful.
Regarding your trouble with Databricks: Is there already an existing issue that describes the problem? If so, you could mention me in a comment there and I'll try to help.
Otherwise please open a new one.

Hi @nightscape - @carlo-quinonez is my corporate account and my company wants us to use personal accounts for working on public projects.

Can you make a collaborator so I can tag issues?

Hello @nightscape!
I am trying to write multiple dataframes into an excel file which will contain multiple sheets inside it.

How do I provide a name for those individual excel sheets?

Here's what I am trying to do:
df1.write().format("com.crealytics.spark.excel").option("sheetName","My Sheet 1").option("header", "true").save("hdfs://127.0.0.1:9000/var/www/" + outFile + ".xls");

df2.write().format("com.crealytics.spark.excel").option("sheetName","My Sheet 2").option("header", "true").mode(SaveMode.Append).save("hdfs://127.0.0.1:9000/var/www/" + outFile + ".xls");

Hey @NiranjanBalasubramani, wrong place 😉
This issue is about contributing to the project.
Questions are probably best asked on StackOverflow.

Hey!
I would like to be a co-maintener. How can I help? I have large experience on Scala language and other JVM languages.

Hi @daviddelucca, great! I've added you as a collaborator 😃 👍

Hi @nightscape, I would like to contribute to this project. I already have a fix for the #327 issue which is working for me in local. I also made a local working copy of spark csv parser like columnNameOfCorruptRecord option to this project .

Hey @Harivignesh3419 sounds great!
Can you open a PR for it?
I'll add you as a contributor after your first successful PR 👍

Hi @nightscape , I would like to contribute to this project. Areas those I will be able to contribute:

  • Fixing these two issue on input_file_name() and loading multiple excel files with glob pattern #52 #74 #97
  • Upgrade to spark data source API 2.0
  • And would like to simplify the project dependencies by dropping the writing support (we can discuss the detail later)

Thanks

Hi @quanghgx, great!!
For the first two points it would probably be good to start with the open PR to make Excel a FileFormat.
That would have to be rebased on latest master branch first though.
Afair the existing tests were already working, but I added new tests for reading and writing multiple files and I think those failed.

Thanks @nightscape that's great. I haven't known this branch before. Detail will be on the PR thread. Let me some time.

Maybe I can help with the documentation, I am new to Spark and Scala.

@rudolph-m that would be great! Being new to sth. leaves no room for assumptions about implicit knowledge.

let me know how I can assist. I found this repo while I was looking for a way to read excel files in Spark and it was not easy for me to understand how to use the library

@rudolph-m could you maybe open a new issue regarding improving the documentation and mention there what you found hard to discover and what information would have helped you?
We could provide you with the information then and you expand it into sth. understandable.
WDYT?

Hello, I would like to contribute to the development and maintenance of this repo. Pls let me know how and where can I be of help.

Hey @SubhasisMukherjee, great!!
Do you have some Scala experience, or mostly Python?

Hey @nightscape I primarily work with Python and PySpark. I have learnt Scala as well recently, but not having project experience on Scala, though I am comfortable with reading and modifying codes in Scala.

@SubhasisMukherjee hope you had nice holidays and a good start into the new year 😃
Sorry for not getting back earlier...
Do you happen to use spark-excel in some cloud environment like Azure, AWS or GCS?

@nightscape I can go through some of the issues and see if they can be quickly resolved. If I get some more time, I don't mind helping to maintain this project. Let me know if you need anything from me.

@williamdphillips great! I added you to the project members 👍

hi,
I can take complete ownership of the maintenance of the package. Let me know if you have got any responses for this co-mainteners.