Add `from_dense` and `to_dense` methods by eriknw · Pull Request #382 · python-graphblas/python-graphblas

eriknw · 2023-02-04T19:15:29Z

Closes #378.

@jim22k, I think we discussed to/from dense years ago, and I think we punted on methods like in this PR b/c it's surprisingly awkward to do in GraphBLAS. This PR shows that, indeed, this is difficult to do well, so I think they're good methods to add.

I also added sort=True to e.g. to_csr, because the C spec says column indices within a row may be unsorted, but sometimes we want them to be sorted. to_coo already has sort=True, so adding sorting more places seems natural.

Currently, from_dense is also used to create a fully dense Vector/Matrix from a scalar, which requires the shape to be provided. I think I like this better than a separate method such as Matrix.from_scalar(1, nrows=3, ncols=4).

This also improves some handling of sub-array dtypes.

I still need to add documentation. I think the APIs and behaviors need the most careful review.

coveralls · 2023-02-04T19:53:21Z

coverage: 99.396% (-0.08%) from 99.48%
when pulling b4a80a4 on eriknw:from_dense
into 1bbce69 on python-graphblas:main.

graphblas/core/matrix.py

scripts/check_versions.sh

graphblas/core/vector.py

eriknw · 2023-02-08T17:43:57Z

missing_value= is a nice addition to from_dense. However, missing_value= doesn't make any sense for building a dense object from a scalar, so I feel like this is now squeezing too much functionality into from_dense.

Compare:

def from_dense(cls, value, missing_value=None, dtype=None, *, nrows=None, ncols=None, name=None):
    # handle scalars and arrays

vs

    def from_dense_scalar(cls, value, nrows, ncols, dtype=None, *, name=None):
    def from_dense_array(cls, values, missing_value=None, dtype=None, *, name=None):

I prefer two methods. But, what should we name them?

eriknw · 2023-02-08T18:00:21Z

I prefer two methods. But, what should we name them?

Perhaps:

from_dense for arrays
Add fill_value=None to the main constructor to fill with a scalar
- e.g., Vector(size=5, fill_value=10)
or from_iso_value for scalar

… `io.from_numpy`

eriknw · 2023-02-09T04:45:20Z

Splitting from_dense into two functions (one for "dense from scalar") simplified the implementation, so +1 from me.

I still need to update documentation. Do we like from_iso_value?

SultanOrazbayev · 2023-02-09T13:03:52Z

Splitting from_dense into two functions (one for "dense from scalar") simplified the implementation, so +1 from me.

Yes, it is more readable now.

I still need to update documentation. Do we like from_iso_value?

Looks good, I'm not sure how frequently this specific function is used (e.g. in algorithms).

eriknw · 2023-02-10T01:37:54Z

Updated and ready for review.

eriknw · 2023-02-15T16:04:14Z

We discussed this in our meeting today, and we settled on from_scalar instead of from_iso_value. We should also add a couple more notes such as this being cheap / low storage in SuiteSparse:GraphBLAS, and maybe show an alternative example for how to create a new matrix with a scalar value and a mask.

We could consider adding a mask= keyword to from_scalar, but this implies a shape, which makes things a little awkward.

eriknw · 2023-02-15T18:29:47Z

from_iso_value is renamed to from_scalar, and I made a note about SuiteSparse:GraphBLAS being efficient.

@jim22k, what note about masks did you want to add?

I think this PR is ready to merge, but I'll wait for an approval (for a couple days).

jim22k · 2023-02-15T19:56:57Z

The note could say that if you want to create an iso-valued Matrix or Vector with the same structure as an existing object, use obj.apply(binary.second, right=scalar).

…calar A more flexible way with any mask is e.g.: ```python w = Vector(v.dtype, size=v.size) w(~v.S) << value ```

eriknw · 2023-02-17T05:11:03Z

Thanks @jim22k and @SultanOrazbayev for your input on this PR! Even though I'm getting credit for commits, I know y'all deserve credit for helping too, because your engagement does make things better ❤️

This is going in! 🚀

Add from_dense and to_dense methods

44a9e6d

eriknw added feature Something is missing io Data input, output, and conversions labels Feb 4, 2023

eriknw requested review from SultanOrazbayev and jim22k February 4, 2023 19:15

eriknw mentioned this pull request Feb 4, 2023

Improve construction when inferring sub-array dtype (a.k.a. array subdtype) #381

Merged

oops; use cls, not self

3483ff5

eriknw marked this pull request as draft February 5, 2023 05:07

Add documentation (first draft)

c42e9ab

eriknw marked this pull request as ready for review February 5, 2023 17:56

eriknw added 2 commits February 6, 2023 22:21

bump ruff

d8902a1

That was fast!

e4fa913

SultanOrazbayev reviewed Feb 7, 2023

View reviewed changes

graphblas/core/matrix.py Show resolved Hide resolved

SultanOrazbayev reviewed Feb 7, 2023

View reviewed changes

scripts/check_versions.sh Show resolved Hide resolved

eriknw added 2 commits February 7, 2023 13:23

Add warning that to_dense can create very large arrays

0b2ac41

Add a test

687d443

eriknw commented Feb 8, 2023

View reviewed changes

graphblas/core/vector.py Outdated Show resolved Hide resolved

graphblas/core/vector.py Outdated Show resolved Hide resolved

Merge branch 'main' into from_dense

9751e99

Add from_iso_value, missing_value= to from_dense, and deprecate…

fe9d2c0

… `io.from_numpy`

eriknw added the deprecation Something is being removed label Feb 9, 2023

Update documentation

0a93f93

eriknw added 2 commits February 9, 2023 22:09

better (haha, what was I thinking before?)

0bd936a

bump ruff

93d533a

Rename from_iso_value to from_scalar

cefc282

Update flake8-bugbear and make improvements

06ee503

jim22k approved these changes Feb 15, 2023

View reviewed changes

Add comment for how to create iso-valued objects with structure and s…

b4a80a4

…calar A more flexible way with any mask is e.g.: ```python w = Vector(v.dtype, size=v.size) w(~v.S) << value ```

eriknw merged commit d226a51 into python-graphblas:main Feb 17, 2023

eriknw mentioned this pull request Feb 19, 2023

Read Matrix Market with fast_matrix_market #391

Merged

2 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Comments

Add `from_dense` and `to_dense` methods#382

Add `from_dense` and `to_dense` methods#382
eriknw merged 15 commits intopython-graphblas:mainfrom
eriknw:from_dense

eriknw commented Feb 4, 2023 •

edited

Loading

Uh oh!

coveralls commented Feb 4, 2023 •

edited

Loading

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

eriknw commented Feb 8, 2023

Uh oh!

eriknw commented Feb 8, 2023 •

edited

Loading

Uh oh!

eriknw commented Feb 9, 2023

Uh oh!

SultanOrazbayev commented Feb 9, 2023

Uh oh!

eriknw commented Feb 10, 2023

Uh oh!

eriknw commented Feb 15, 2023

Uh oh!

eriknw commented Feb 15, 2023

Uh oh!

jim22k commented Feb 15, 2023

Uh oh!

eriknw commented Feb 17, 2023

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

Comments

Conversation

eriknw commented Feb 4, 2023 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

coveralls commented Feb 4, 2023 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

eriknw commented Feb 8, 2023

Uh oh!

eriknw commented Feb 8, 2023 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

eriknw commented Feb 9, 2023

Uh oh!

SultanOrazbayev commented Feb 9, 2023

Uh oh!

eriknw commented Feb 10, 2023

Uh oh!

eriknw commented Feb 15, 2023

Uh oh!

eriknw commented Feb 15, 2023

Uh oh!

jim22k commented Feb 15, 2023

Uh oh!

eriknw commented Feb 17, 2023

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

eriknw commented Feb 4, 2023 •

edited

Loading

coveralls commented Feb 4, 2023 •

edited

Loading

eriknw commented Feb 8, 2023 •

edited

Loading