DOC Increase prominence of starting from existing issues #31660

betatim · 2025-06-25T13:00:11Z

What does this implement/fix? Explain your changes.

This PR makes a few changes to our contributing documentation. The goal is to increase the prominence of the advice to start from known issues and ways of helping out that do not involve writing (unsolicited) code.

I think we can safely state that we are selective (not just somewhat selective) when it comes to new estimators as well as new features. The linked FAQ entry applies to all kinds of code contributions.

Any other comments?

We could probably also work on the PR template to include questions similar to the ones for new features. Maybe in a new PR. We could also make bigger changes to the contrib docs, I was looking at the Numpy guide which feels nice and organised. But again, maybe something for the future

What do people think?

Try to set expectations regarding new features and contributing by writing code.

github-actions · 2025-06-25T13:01:07Z

✔️ Linting Passed

All linting checks passed. Your pull request is in excellent shape! ☀️

_{Generated for commit: f0358af. Link to the linter CI: here}

lucyleeow

Thanks @betatim ! These is a nice improvement

lucyleeow · 2025-06-26T04:03:11Z

doc/developers/contributing.rst

-adding new algorithms, and the best way to contribute and to help the project
-is to start working on known issues.
+Scikit-learn is :ref:`selective <selectiveness>` when it comes to
+adding new algorithms and features. This means the best way to contribute


Suggested change

adding new algorithms and features. This means the best way to contribute

adding new algorithms, features and enhancements. This means the best way to contribute

is this too much? 😬

Unsure. I'd argue that an enhancement is a new feature.

StefanieSenger · 2025-06-26T06:33:21Z

Thanks a lot, @betatim!

Below, we have the sentence "We are glad to accept any sort of documentation:" (currently line 705) quite prominently. I would think regarding our discussion, that's also not fully true and we should rather hint to that there are of cause conditions?

lucyleeow · 2025-06-26T06:37:12Z

Good point. "Thoughtful documentation contributions, not copied directly from a LLM..." 😅

betatim · 2025-06-26T07:27:43Z

Happy to change it, in particular because I am not quite sure what that sentence is trying to say right now. It feels like documentation isn't a thing you can accept - you can accept contributions to the documentation.

Maybe something like "The project contains many kinds of documentation, contributions to any of these are welcome:"? What do you think?

I would skip the comment about LLMs, feels like going into the details too soon and it is already discussed in the "Automated contributions policy" section.

StefanieSenger · 2025-06-26T07:37:52Z

Maybe something like "The project contains many kinds of documentation, contributions to any of these are welcome:"? What do you think?

Hm, I think we want to foreshadow that not any addition to the documentation will be accepted, for expectation management.

Maybe:
"We welcome thoughtful contributions to the documentation and are happy to review additions in the following areas:"
(?)

betatim · 2025-06-26T09:10:36Z

I added it

StefanieSenger

Its an improvement, thank you @betatim!

I think I would chose much clearer language (see my comments). In reality, it probably doesn't matter so much, because the people who are not considerate are the same who don't read this. 🤷

CONTRIBUTING.md

doc/developers/contributing.rst

StefanieSenger · 2025-06-26T09:27:10Z

@reshamas maybe also has some ideas?

Co-authored-by: Stefanie Senger <[email protected]>

reshamas · 2025-06-26T23:42:51Z

Adding here the transcript to Andy's video for reference:
https://github.com/data-umbrella/data-umbrella-scikit-learn-sprint/blob/master/1_transcript_ACM_contributing_sklearn.md

slide 14:

I've been with the project for a long time and basically any pull requests that I do will have a long discussion and will undergo many iterations before it gets merged, if it gets merged.

slide 15:

So there's also other ways to contribute then finding an issue and working on them. You can also just fix something in the docs that's unclear. You don't necessarily need to open issue for this, so just like improve the documentation if there's something you don't like about it. Or just open issues. Open issues about unclear docs, about features that you find weird, about examples that are not helpful, about bugs you run into.

Particularly slide 19:

That is because scikit-learn is already quite a mature library and so it's moving quite slowly.

Yeah so if you want to add a major features to scikit-learn, that's probably not something you can do in a day. Adding a new model to scikit-learn is usually something that take many months and it's not something you should try to attempt at the beginning. So really start with something simple and then maybe if you got your first first two pull requests and you can work at adding like a smaller feature but don't count on adding a big feature anytime soon. That is because scikit-learn is already quite a mature library and so it's moving quite slowly. And so it's hard to add anything big or make any big new changes. Also there might be a lot of interesting issues that are not appropriately tagged. So if you're interested in particular topic you can just search the topic on the issue tracker or on the pull requests and see if there's something interesting happening there.

reshamas · 2025-06-26T23:52:49Z

@betatim @StefanieSenger
I think this PR is a good start. I have extensive changes I would like to make, but need to map them out (in the same way I did with #31519), but I can do that at a later time.

betatim · 2025-06-27T06:38:03Z

@reshamas I think taking a "big picture" view of revising thing sis a good idea. Like you said it will require a bit of preparation. Is it ok if we do that outside of this PR and don't wait for it before merging this?

It isn't clear to me what we should do in this PR with the snippets of Andy's talk? To me it feels like a lot of what he said is reflected already in the guide: start small, PRs often go unmerged (no matter who you are), adding a new feature/algorithm is one of the hardest things known to humankind

Tweak contributing docs

1551ce2

Try to set expectations regarding new features and contributing by writing code.

betatim changed the title ~~Tweak contributing docs~~ DOC Increase prominence of starting from existing issues Jun 25, 2025

github-actions bot added the Documentation label Jun 25, 2025

lucyleeow approved these changes Jun 26, 2025

View reviewed changes

Set expectations for documentation contributions

ba9d8e8

StefanieSenger approved these changes Jun 26, 2025

View reviewed changes

CONTRIBUTING.md Outdated Show resolved Hide resolved

CONTRIBUTING.md Outdated Show resolved Hide resolved

doc/developers/contributing.rst Show resolved Hide resolved

doc/developers/contributing.rst Outdated Show resolved Hide resolved

betatim and others added 2 commits June 26, 2025 12:46

Apply suggestions from code review

754b02a

Co-authored-by: Stefanie Senger <[email protected]>

White space

f0358af

Co-authored-by: Stefanie Senger <[email protected]>

reshamas mentioned this pull request Jun 27, 2025

DOC add a link to GitHub Discussions on home page #30089

Open

	adding new algorithms and features. This means the best way to contribute
	adding new algorithms, features and enhancements. This means the best way to contribute

Uh oh!

DOC Increase prominence of starting from existing issues #31660

Are you sure you want to change the base?

DOC Increase prominence of starting from existing issues #31660

Uh oh!

Conversation

betatim commented Jun 25, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

What does this implement/fix? Explain your changes.

Any other comments?

Uh oh!

github-actions bot commented Jun 25, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

✔️ Linting Passed

Uh oh!

lucyleeow left a comment

Choose a reason for hiding this comment

Uh oh!

lucyleeow Jun 26, 2025

Choose a reason for hiding this comment

Uh oh!

betatim Jun 26, 2025

Choose a reason for hiding this comment

Uh oh!

StefanieSenger commented Jun 26, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

lucyleeow commented Jun 26, 2025

Uh oh!

betatim commented Jun 26, 2025

Uh oh!

StefanieSenger commented Jun 26, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

betatim commented Jun 26, 2025

Uh oh!

StefanieSenger left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

StefanieSenger commented Jun 26, 2025

Uh oh!

reshamas commented Jun 26, 2025

Uh oh!

reshamas commented Jun 26, 2025

Uh oh!

betatim commented Jun 27, 2025

Uh oh!

Uh oh!

betatim commented Jun 25, 2025 •

edited

Loading

github-actions bot commented Jun 25, 2025 •

edited

Loading

StefanieSenger commented Jun 26, 2025 •

edited

Loading

StefanieSenger commented Jun 26, 2025 •

edited

Loading